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5! © Novel antigens and methods for their preparation. 

© A composite particle comprising at least two polypeptides corresponding to all or part of a protein having the 
W biological activity of one of the hepatitis B surface antigens, wherein the particle presents at least two antigenic 
^ determinants provided by the S-protein, preS2-protein or preS1 -protein, said particle optionally further compris- 
r- ing host specific lipids. The invention also provides a modified hepatitis B surface antigen L protein which may 
^ be used alone or incorporated into such composite particles. 

© A particular embodiment of the modified L protein (L*) comprises residues 12-52 followed by residues 133- 
A 145 followed by residues 175-400 of the L protein and a particular embodiment of a composite particle has the 
y] form (L*, S) where S is the S-protein of HBsAg. 

Improved hepatitis B vaccines may be prepared following expression of the above immunogens, especially 
in yeasts such as S. cerevisiae and Hansenula polymorpha . 
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NOVEL ANTIGENS AND METHODS THEREFOR 



The invention relates to a microorganism selected from the group of methylotrophic yeast genera, a 
DNA molecule containing an expression cassette, a vector suitable for transformation of a methylotrophic 
yeast, a process for the preparation of a microorganism, a composite particle containing at least two 
polypeptides corresponding to all or part of a protein having the biological activity of hepatitis B surface 
5 antigen, a method for the preparation of composite particles and to compositions, vaccines and test kits 
containing same. 

The invention also relates to the construction of microorganism strains synthesizing modified large 
surface proteins of hepatitis B virus, especially hepatitis B surface L antigens which are modified at various 
regions of the coding seguence to alter and enhance the biological activity thereof. 

ro Hepatitis is a wide spread, serious infectious disease, which may be caused by several agents. One 
important cause was identified as hepatitis B virus (HBV), a small virion of about 43 nm diameter containing 
a genome which consists of a 3,200 base pair double-strained DNA circle having a single stranded gap of 
variable length. Infection with this virion results in an acute, sometimes fatal disease. Eighty five percent of 
affected persons recover after about three months of illness, about 1% suffers from acute necrosis of liver 

rs tissue, resulting in death within a short period, and about 10% suffer from a recurring outbreak of the 
disease. Approximately 4% of infected persons develop a chronic carrier state with an increased risk of 
primary hepatocellular carcinoma or liver cirrhosis. 

Infection mostly occurs via perinatal or parenteral transmission of infectious virions, for example in the 
course of blood transfusions or by use of contaminated injection syringes or tips, or by sexual contact 

20 Therefore there is an urgent need for reliable diagnostic means for detection of HBV and for a vaccine 
protecting people at high risk of exposure, such as spouses of chronic carriers, travellers to areas of high 
HBV endemicity, newborns of chronic carriers, homosexuals, prostitutes and drug abusers, which should be 
affordable at low costs. Furthermore in third world countries there exists a need for an inexpensive vaccine 
for mass immunisation programmes. 

25 In addition to viral particles consisting of the core (HBcAg) and capsid or surface (HBsAg) antigen 
proteins enclosing the circular, partially double stranded DNA genome the blood of infected individuals may 
contain substantial quantities of HBsAg particles containing the surface antigen protein and host lipids. 
These particles have been purified from human serum and formulated as vaccines which have been shown 
to confer protection against HBV infection (Szmuness et al, 1980). 

30 In this application certain publications are referenced by mention of the first author and the year of 
publication within parenthesis. Full citation for these references listed according to their alphabetical order 
may be found at the end of the specification. The disclosures of these publications in their entireties are 
hereby incorporated by reference into this application in order to more fully describe the state of the art. 
The preparation of vaccines from serum derived HBsAg has been hampered by the limited supply of 

35 infected blood and by the need to undergo long and rigorous testing to ensure the removal and/or 
inactivation of potential infectious agents. 

Hepatitis B virus (HBV) infection in humans is associated with the occurrence in the serum of various 
structures carrying the hepatitis B surface antigen (HBsAg) [Tiollais et al, Nature , 17 ;489 (1985)]. In 
addition to infectious virions, filamentous and spherical particles of 22 nm in diameter (containing about 100 

40 envelope proteins) are present which are formed by association of host-derived lipids with the three 
hepatitis surface proteins: the major (S), middle (M) and large (L) proteins. 

These proteins share the same seguence of 226 amino acids codons on the HBV genome, known as 
the S protein coding seguence. The entire amino acid coding seguence which immediately precedes the S 
protein coding sequence on the HBV genome is referred to herein as the preS coding sequence. The preS 

45 coding sequence codes for a 55 amino acid sequence which immediately precedes the S protein, called the 
preS2 region, and, depending on the virus subtype, either a 108 or 119 amino acid sequence which 
immediately precedes the preS2 region, called the preS1 region. 

The entire preS coding sequence thus codes for 163 (55 preS2 + 108 preS1) amino acids in ay 
subtypes and for 174 (55 preS2 + 119 preS1) amino acids in most ad subtypes. Sequence comparisons 

so between the preS coding sequences in the genomes of ad and ay isolates have shown that the first eleven 
codons of the preS1 region of ad isolates are not present in ay subtypes. In agreement with the generally 
adopted nomenclature, throughout this application the codon and amino acid numbering of ad subtypes is 
followed, meaning that HBV genomes of ad isolates encode a preS region of 174 amino acids numbered 1 
to 174 while the 163 amino acids preS region of ay subtypes are numbered 12 to 174. 

The major protein (S) is encoded by the 226 amino acid codons S gene and exists in glycosylated and 
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non-glycosylated forms. 

The middle protein (M) includes the preS2 region and the S protein (M protein: 55 plus 226 amino 
acids). The M protein is a glycoprotein present in two forms according to the extent of glycosylation [Stibbe 
et al, J. Virol 46 ;626-628 (1983)]. The 55 amino acids encoded by the preS2 region are hydrophilic and 

5 contain an immunodominant epitope located at the surface of the envelope gene (residues 132-137). This 
epitope is disulfide bond-independent and reported to be more immunogenic than epitopes of the S protein. 
[Neurath et al, Science , 224 :392-394 (1984); Neurath et al., Nature , 315 :1 54-1 56 (1985)]. The preS2 
sequence also contains a receptor for polymerized human serum albumin (PHSA) [Machida et al, Gastroent 
86 :910-918 (1984)]. This receptor can also bind monomeric HSA and may mediate attachment of HBV to 

w hepatocytes, which also have a receptor for pHSA. It has been suggested that such binding could lead to 
tolerance or auto-immune reactions in humans. 

The large protein (L) includes the preS1 region, the preS2 region and the S coding sequence (L protein: 
108 (or 119) plus 55 plus 226 amino acids). It is present in glycosylated and non-glycosylated forms. The L 
protein is of variable length according to the subtype, e.g., 389 and 400 amino acids long for the ay and ad 

75 subtypes, respectively. The preS1 translation product is also involved in the attachment of HBV to 
hepatocytes. 

Because the promoter for the S and M specific transcripts is embedded within the open reading frame 
of the L protein, transformation of mammalian cells with DNA encoding the complete open reading frame for 
the L protein may result in synthesis of all three surface proteins. In mammalian cells, HBsAg particles are 

20 secreted [Tiollais et al, cited above]. In mammalian systems expression of L protein causes an inhibition of 
secretion of HBsAg particles. [See, e.g., Ou et al, J. Virol ., 61^ :782 (1987)]. This phenomenon has been 
suggested to be related to the presence on the L protein of an N-myristoyl group, causing interference with 
the budding or secretion process [Persing et al, Science , 234 :1 388-1 391 (1986); Persing et aj, JL Virol ., 61^ 
:1 672-1 677 (1987)]. - — 

25 For ease of reference, the following Table A sets out the DNA coding sequences and amino acid 
sequences for the preS1, preS2 and S regions of the HBV proteins discussed herein. Amino acid numbers 
are above the codons in parentheses and are numbered from the first ATG codon present in the open 
reading frame for the envelope proteins found in HBV adw isolate sequence which was cloned on plasmid 
pR!T10616. As the first eleven codons of this ad sequence are not present in any of the expression casettes 

30 described in the present application these sequences are not given in Table A. Plasmid pRIT10616, 
harboured in E. coli K12 strain C600, is on deposit at the American Type Culture Collection. Rockville, 
Maryland, U.SA under accession number ATCC 39131. 
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Table A 
PRE -SI REGION 

(12) 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG GGA TTC TTT 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe 

(26) 

CCC GAT CAT CAG TTG GAC CCT GCA TTC GGA GCC AAC TCA AAC 
Pro Asp His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn 

(40) 

AAT CCA GAT TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG CCA 
Asn Pro Asp Trp Asp Phe Asn Pro lie Lys Asp His Trp Pro 

(54). 

GCA GCC AAC CAG GTA GGA GTG GGA GCA TTC GGG CCA GGG CTC 
Ala Ala Asn Gin Val Gly Val Gly Ala Phe Gly Pro Gly Leu 

(68) 

ACC CCT CCA CAC GGC GGT ATT TTG GGG TGG AGC CCT CAG GCT 
Thr Pro Pro His Gly Gly lie Leu Gly Trp Ser Pro Gin Ala 

(82) 

CAG GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT CCT CCT GCC 
Gin Gly He Leu Thr Thr Val Ser Thr He Pro Pro Pro Ala 
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(96) 

TCC ACC AAT CGG CAG TCA GGA AGG CAG CCT ACT CCC ATC TCT 
Ser Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro lie Ser 

(110) (119) 
CCA CCT CTA AGA GAC AGT CAT CCT CAG GCC 
Pro Pro Leu Arg Asp Ser His Pro Gin Ala 

PRB-S2 REGION 

(120) 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA GCT CTG CAG GAT 
Met Gin Trp Asn Ser Thr Ala Phe His Gin Ala Leu Gin Asp 

(134) 

CCC AGA GTC AGG GGT CTG TAT TTT CCT GCT GGT GGC TCC AGT 
Pro Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser 

(148) 

TCA GGA ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT CAC ATA 
Ser Gly Thr Val Asn Pro Ala Pro Asn lie Ala Ser His lie 

(162) (174) 
TCG TCA AGC TCC GCG AGG ACT GGG GAC CCT GTG ACG AAC 
Ser Ser Ser Ser Ala Arg Thr Gly Asp Pro Val Thr Asn 

S PROTEIN 

(175) 

ATG GAG AAC ATC ACA TCA GGA TTC CTA GGA CCC CTG CTC GTG 
Met Glu Asn He Thr Ser Gly Phe Leu Gly Pro Leu Leu Val 

TTA CAG GCG GGG TTT TTC TTG TTG ACA AGA ATC CTC ACA ATA 
Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg He Leu Thr He 



EP 0 414 374 A2 



CCG CAG AGT CTA GAG TCG TGG 
Pro Gin Ser Leu Asp Ser Trp 

GGG GGA TCA CCC GTG TGT CTT 
Gly Gly Ser Pro Val Cys Leu 

ACC TCC AAT CAC TCA CCA ACC 
Thr Ser Asn His Ser Pro Thr 

GGT TAT CGC 
Gly Tyr Arg 

TTC ATC CTG 
Phe lie Leu 

GAT TAT CAA GGT ATG TTG CCC 
Asp Tyr Gin Gly Met Leu Pro 

TCA ACA ACA ACC AAT ACG GGA 
Ser Thr Thr Thr Asn Thr Gly 

CCT GCT CAA GGC AAC TCT ATG 
Pro Ala Gin Gly Asn Ser Met 



TGG ACT TCT CTC AAT TTT CTA 
Trp Thr Ser Leu Asn Phe Leu 

GGC CAA AAT TCG CAG TCC CCA 
Gly Gin Asn Ser Gin Ser Pro 

TCC TGT CCT CCA ATT TGT CCT 
Ser Cys Pro Pro lie Cys Pro 



GTT TGT CCT CTA ATT CCA GGA 
Val Cys Pro Leu lie Pro Gly 

CCA TGC AAA ACC TGC ACG ACT 
Pro Cys Lys Thr Cys Thr Thr 

TTT CCC TCA TGT TGC TGT ACA 
Phe Pro Ser Cys Cys Cys Thr 



TGG ATG TGT CTG CGG CGT TTT ATC ATA TTC CTC 
Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu 

CTG CTA TGC CTC ATC TTC TTA TTG GTT CTT CTG 
Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu 



AAA CCT ACG GAT GGA AAT TGC ACC TGT ATT CCC ATC CCA TCG 
Lys Pro Thr Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser 

TCC TGG GCT TTC GCA AAA TAG CTA TGG GAG TGG GCC TCA GTC 
Ser Trp Ala Phe Ala Lys Tyr Leu Trp Glu Trp Ala Ser Val 

CGT TTC TCT TGG CTC AGT TTA CTA GTG CCA TTT GTT CAG TGG 
Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin Trp 
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TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA GCT ATA TGG 
Phe val Gly Leu Ser Pro Thr val Trp Leu Ser Ala He Trp 
ATG ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC GTG AGT 
Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He val Ser 

CCC TTT ATA CCG CTG TTA CCA ATT TTC TTT TGT CTC TGG GTA 
Pro Phe lie Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val 

(400) 
TAC ATT TAA 
Tyr He Stop 

Viral transcription initation signals located 5' to the translation initiation codons control the expression of 
each of the preS1, preS2 and S proteins in vivo . Upon translation in the mammalian cell, the proteins may 
be glycosylated, resulting in a set of surface antigens as demonstrated in the following table. 



Protein 


Length 


Designation 


Glycosylation of 


Glycosylation of 




(amino 




S-protein Sequences 


preS2-specific 




acids) 






domains 


P24 


226 


S-protein 






GP27 


226 


S-protein 


+ 




GP33 


281 


preS2-protein 




+ 


QP36 


281 


preS2-protein 


+ 


+ 


P39* 


389 


preS1 -protein 






GP42* 


389 


preS1 -protein 


+ 





*: Subtype ayw; the preS1 -proteins derived from adw-subtype are about 1.0 to 1.5 kDa 
larger, which is in agreement with an additional 11 amino acids at the N-terminal end 
(Heermann et al., 1984). 



Since the DNA sequence of the surface antigen coding region became known, several attempts have 
been made to produce S protein by virtue of recombinant DNA technology. Burrell et al.(1979) and Murray 
et al. (1980) as well as several other researchers reported the expression of HBsAg in E coii. Valenzueia et 
al. (1982) reported the expression of HBsAg in yeast. Following disruption of yeast cells transformed with a 
vector containing the S protein coding sequence under control of the yeast alcohol dehydrogenase I 
promoter, spherical particles containing HBsAg could be observed which are similar to those found in sera 
of individuals infected with HBV. In European Patent Application 0 226 846 (Tschopp et al.) the production 
of HBV S-protein in the yeast Pichia is disclosed. Protein synthesis is under the control of a regulatory 
region responsive to methanol. 

However, although vaccines containing exclusively S-antigen particles are generally highly protective 
(Szmuness et al., 1980) some hosts respond poorly to such preparations. To overcome this unpredictable 
failure of the vaccination, systems for expression of preS2-protein and preS1 -protein have been developed. 
PreS2 is known to be even more immunogenic than S-protein. For example, upon immunisation of mice 
with subviral particles bearing preS2-and S-specific epitopes, the antibody response to preS2 exceeds the 
anti-S-response (Miiich et al.. 1985). Furthermore it has been observed that immunisation with preS2 
containing particles may also induce an anti-S-response. The role of preS1- and preS2-proteins in 
immunisation and the specificity of the T- and B-cell recognition of HBV surface antigen proteins has 
recently been reviewed (Miiich, 1988. with references therein). Valenzueia et al. (1985) disclose the 
expression of the entire preS2- protein coding sequence in yeast transformed with a plasmid containing the 
corresponding viral sequences. It has to be noted, that in yeast transcription, initiation has to be directed by 
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yeast promoters preceding the respective coding region, since the viral transcription initiation signals within 
the HBV-ORF are not recognised by the yeast transcription system. 

Itoh et al. (1986) and others also disclose expression of the preS2-protein in yeast and assembly into 
particles. Dehoux et al. (1986) disclose expression of a 39 kD preS1 -protein in yeast and state that this is 
5 assembled into a particle form. A report by Imamura et al. (1987) discloses expression in S. cerevisiae of 
both the preS2-protein and the preS1 -protein. These authors state that the large protein was found as a 
relatively stable, non-giycosylated product of 42 kD molecular weight which was not readily assembled into 
particles. 

Giycosylation of the surface proteins produced in the yeast S. cerevisiae differs from glycosylation 

w observed in naturally occuring particles isolated from individuals infected with HBV. The S-protein is not 
glycosylated, whereas preS2-protein and preS1 -protein are produced in N-linked glycosylated and non- 
glycosylated forms. N-linked chains of the high mannose type were identified, as well as O-linked 
oligosaccharide chain(s) (Itoh et al. f 1986; Langley et al., 1988). 

Expression of HBV surface antigen in mammalian cells has also been performed. Michel et al. (1984) 

75 disclosed the synthesis of HBsAg in Chinese Hamster ovary (CHO) cells transfected with a plasmid carrying 
the preS2-protein coding sequence. Persing et al. (1985) disclosed the expression of three HBsAg related 
polypeptides of 24,000, 27,000 and 35,000 daltons in mouse L cells transformed with a preS2-protein 
coding sequence. The three polypeptides obtained may be organised into complex immuno reactive HBsAg 
particles of about 22 nm diameter. 

20 McLachlan et al. (W088/06185) reported on the expression of various HBV related proteins in vertebrate 
cells. However, the expression of preS-antigen in mammalian cells is generally hampered by the fact that 
due to relative overproduction of preS1 -protein relative to S-protein secretion of HBsAg particles is inhibited 
(Ou et al., 1987). Moreover the ratio of the different S related proteins cannot be controlled. Finally culturing 
of animal cells is an expensive approach which results in high costs for the product obtained. 

25 Although vaccines presently described and in use have great efficacy, a certain percentage of persons 
receiving such vaccines, particularly immunocompromised persons, e.g., hemodialysis patients, are non- or 
slow responders [See, e.g., Hadler et al, New Engl. J. Med. , 315 :209-215 (1986); and Bruguera et al, Post 
Grad . Med. J. , 63 :1 55-1 58 (1987)]. 

There thus remains a need in the art for methods and compositions useful in preparing additional 

30 effective vaccines to HBV. 

According to a first aspect of the present invention there is provided a microorganism selected from the 
group of methylotrophic yeast genera, carrying 

A) more than one copy of an expression cassette (ed) coding for a first polypeptide and/or one or more 
than one copy of an expression cassette (ec2) coding for a second polypeptide, optionally in addition 

35 one or more than one expression cassette (ec3) coding for a third polypeptide, or 

B) one copy of an expression cassette (ed) coding for a first polypeptide and one or more than one 
copy of an expression cassette (ec2) coding for a second polypeptide, optionally in addition one or more 
than one expression cassette (ec3) coding for a third polypeptide, wherein the expression cassettes 
comprise: 

40 a) a regulon R which is responsive to methanol and/or depletion of catabolite repressing carbon sources; 

b) an open reading frame coding for all or part (or parts) of a protein having the biological activity of one 
of the hepatitis B surface antigens; 

c) optionally a DNA sequence serving as a transcription terminator T; wherein R exerts control on the 
transcription of the open reading frame and T directs polyadenylation and/or termination of transcription 

45 of a produced mRNA. 

The advantage of the above aspect of the invention is that a means is provided for obtaining composite 
particles containing various ratios of pre S-and S-protein thereby allowing the large scale production of 
these composite particles at low costs. 

The present invention also provides a composite particle containing at least two polypeptides cor- 
50 responding to all or part of a protein having the biological activity of a hepatitis B surface antigen, wherein 
the particle presents at least two antigenic determinants provided by the S-protein, preS2-protein or preS1- 
protein, said particle optionally further comprising host specific lipids. 

The following terms used throughout the specification are defined as follows: 

An "expression cassette" is defined as any discrete region of DNA which functions in a host cell as a 
55 complete gene expression unit. 

A "complete gene expression unit" is a structural gene and the promoter and regulatory regions 
required for its transcription and translation. 

A "functional DNA coding region" means a DNA coding sequence which, when fused in phase to the S 
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protein coding sequence, does not interfere with the assembly of an HBsAg mixed particle. 

A "functional derivative" means a coding sequence having amino acid alterations which do not interfere 
with particle formation and which retain immunogenicity or other functionality. Such functional derivatives 
can be prepared by conventional site specific mutagenesis or by other standard techniques. 

A "vector" is defined as DNA which can carry and maintain the DNA fragment of the invention, 
including, for example, phages and plasmids. These terms are understood by those of skill in the art of 
genetic engineering. 

The invention, which comprises further subjects, is now described in a more detailed manner by the 
following description, examples and figures. 



Brief Description of the Figures: 
Figure 1: 

Construction of plasmid pME4 from pHARSI . The HARS1 fragment has been transferred to the single 
Pvull site of former pHARSI, the sequences between the former position of HARS and the beta- 
iactamase gene as well as the 5 end of the beta-lactamase gene itself have been deleted. 
Figure 2: 

Schematic representation of lambda-phages carrying genomic Hansenula polymorpha DNA fragments 
comprising 

a) the MOX gene 

b) the FMD gene. 
Figure 3: 

Map of plasmid pT-24. This plasmid contains a DNA fragment encompassing the terminator of the MOX 
gene cloned into the smal-site of the pUC19 multiple purpose cloning site. 
Figure 4: 

Maps of plasmid pRB-S-269 and pRB-S-271 expressing the S-gene under control of the MOX- or FMD- 
promoter, respectively. 
Figure 5: 

Maps of plasmids pBC f pMS-2, pFS-9. The latter two plasmids are derived from PBC containing the 
Hansenula polymorpha URA3 gene by insertion of expression cassettes containing the MOX promoter 
(pMS-2) or the FMD promoter (pFS-9) into the URA3 gene coding sequence. 
Figure 6: 

Map of plasmid pHK2. This plasmid contains a kanamycin gene (derived from Tn5) under control of the 
ADH1-promoter of Saccharomyces cerevisiae . In addition plasmid pHK2 comprises the HARS1 se- 
quence. 
Figure 7: 

Maps of plasmids pRB-S-322 and pRB-5-326. The functional unit consisting of the kanamycin resistance 
gene and ADH1 -promoter as present on plasmid pHK was inserted into the HARS regions of plasmid 
pRB-S-269 and pRB-S-271. 
Figure 8: 

Construction of pMPS-22 from pMOX-P/T1. The preS gene of Hepatitis B virus was inserted into the 
EcoRI site of plasmid pMOX-P/T1 . In the resulting plasmid pMPS-22 the preS gene is regulated by the 
MOX promoter. 
Figure 9: 

Map of plasmid pMPS-9. For the construction of this plasmid the preS expression cassette of pMPS-22 
was inserted between the Xhoi and Stul site of the Hansenula fragment of original plasmid pBC. 
Figure 10: 

Map of the Hansenula polymorpha URA3 gene and adjacent DNA-regions. 
Figure 11: 

A diagram illustrating the structure of plasmid pMPT121 in which foreign gene sequences can be 
inserted under the control of the MOX promoter. 
Figure 12: 

A diagram illustrating the structure of plasmid pKL*1 containing an expression cassette for L* protein 
under control of the MOX promoter. 
Rgure 13: 

A schematic diagram showing the steps for constructing an E. coli plasmid harbouring a Gly13 deleted L 
protein-expression cassette, pRIT12998. 
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Figure 14: 

A schematic diagram illustrating the procedure for obtaining a yeast plasmid pRIT12979 capable of 
expressing a Qly13 deleted L protein. 
Figure 15: 

5 A schematic diagram illustrating the construction of the piasmid pRIT13191 containing the coding 
sequence for an L protein deleted for amino acids 53-132 and 146-174. 
Figure 16 

A schematic diagram illustrating the construction of yeast expression plasmids pRIT13192 and 
pRIT13193 containing the expression cassettes for the L" and AGly L* proteins respectively. 
io ' The microorganism according to the first aspect of the invention may be any member of a 
methylotrophic yeast genus, which has been manipulated to carry more than one copy of an expression 
cassette ec1 and optionally one or more than one expression cassette ec2 and/or optionally one or more 
than one expression cassette ec3 or at least one expression cassette of two or more different types. The 
expression cassettes need only differ in the nature of the encoded protein to be expressed and in general 
15 comprise the following components: 

a) A regulon R which is responsive to methanol and/or depletion of catabolite repressing carbon sources. 
The term "regulon" as used throughout the application comprises any cis-acting DNA involved ih the 
regulation of expression of a given structural gene. The term thus embraces sequences preceding a 
given coding sequence, for example, a promoter and sequences which are recognised by transcription 
20 factors or other proteins involved in transcription regulation. The meaning of the term "regulon" is thus 
not limited to sequences corresponding to known promoters. Also included are DNA sequences 
exhibiting the same responsiveness to methanol as known regulons of methanol inducible genes, i.e. all 
functionally equivalent sequences. 

Recently several regulons exhibiting the above outlined properties became known. For example 

25 Ledeboer et ai. (1985) disclose the regulatory DNA sequence preceding the methanol oxidase (MOX) 
structural gene of Hansenula polymorpha . In EP 85 201 235.0 to Unilever NV, a system for the 
expression of proteins is disclosed, which involves the use of the promoters regulating the expression of 
methanol oxidase and DHAS in vivo . In EP 87 110 417 to Rhein Biotech GmbH, a system suitable for 
expression of foreign proteins under control of the formate dehydrogenase promoter (FMD) is proposed. 

30 Ellis et al. (1985) disclose the isolation of methanol regulatable genes from Pichia pastoris . These and 
other promoters obtained or obtainable from genes of methylotrophic yeast, which are potentially 
involved in methanol utilisation, all have in common that they respond to either addition of methanol 
and/or depletion of a carbon source with enhanced synthesis of the respective gene regulated by them. 
Also included are all DNA sequences which exhibit the same responsiveness. 

35 b) An open reading frame (ORF) coding for part(s) or all of the protein having the biological activity of 
one of the hepatitis B surface antigens. As already mentioned, hepatitis B surface antigens of given 
subtype comprise a set of about six different proteins, differing in length and glycosylation pattern. 
These proteins exhibit different antigenic determinants and also seem to have different functions for the 
viable virus. For eliciting a preS immune response it is not necessarily required to provide the respective 

40 complete preS-sequence, but it may be sufficient to include just those codons encoding the relevant 
epitope into the open reading frame. Furthermore a protein having the biological activity of one of the 
hepatitis B surface antigens also includes proteins having undergone exchanges of amino acids which do 
not interfere with their biological activity. 

c) Optionally a DNA sequence serving as a terminator T. As the terminator any sequence may be used, 
45 which efficiently terminates transcription in the respective host organism. Terminators may be for 
example sequences prone to give formation of hairpin structures and/or polyadenylation sites in 
transcribed RNA. In a preferred embodiment the terminator is derived from a methanol responsive gene 
of the same organism and/or from the same gene as the regulon was obtained. 
The three components are arranged in operable linkage in order to enable the regulon to regulate 
so transcription and the terminator to terminate transcription of the open reading frame between them. 

The microorganism according to the present invention has the invaluable advantage, that because of 
the presence of more than one copy of an expression cassette coding for a first polypeptide and/or one or 
more than one copy of each of the different additional expression cassettes, transcription and translation of 
the encoded protein(s) may occur in highly enhanced amounts. The microorganism according to the 
55 present invention provides the possibility to obtain any desired ratio of a first to a second polypeptide within 
the cell. Thus, upon assembly of protein to form particles composite particles of any composition may be 
obtained. Furthermore owing to the production of high amounts of different proteins in close proximity these 
proteins are more liable to form particles comprising more than one polypeptide species than particles 
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known in the art. 

The microorganism according to the present invention preferably expresses all or part of the S-protein 
from a first expression cassette (ec1) since this constitutes the structural scaffold of any particle which may 
be formed. In one embodiment the microorganism further contains a second expression cassette (ec2) 

5 which expresses a polypeptide representing all or part of the preS1 -protein, for example the preS1 specific 
domains localised at the N-terminai end of this protein or all or part of the preS2-protein. In a further 
embodiment the microorganism may contain three different types of expression cassette to enable 
simultaneous expression of three polypeptides comprising all or part of the S-protein, all or part of the 
preS1 -protein and all or part of the preS2-protein. 

10 Furthermore, the use of regulons derived from methylotropic yeast enables the repression, derepression 
or induction of a cell culture by addition or depletion of a carbon source or by addition of methanol which is 
cheaply available. 

The group of methylotrophic yeasts comprises members of the following genera: Candida , Kloeckera , 
Saccharomyces , Rhodotorula , Torulopsis and Pichia , preferably Hansenula . In the most preferred 
75 embodiment a microorganism according to the present invention is derived from Candida boidinii , Pichia 
pastoris or Hansenula polymorpha . Yeasts from these genera are well known to persons skilled in the art 
and their general culturing and growth conditions are established. 

A useful strain of the preferred methylotrophic yeast Hansenula may be any strain of the species 
Hansenula polymorpha . In the most preferred embodiment the strain to be used has the identifying 
20 characteristics of Hansenula polymorpha RB 10(DSM 5215). RB 10 is a ura3 mutant of Hansenula 
polymorpha ATCC 34438. 

In a preferred embodiment the microorganism according to the present invention is capable of efficient 
utilisation of glycerol as a carbon source. Glycerol is a non-catabolite repressing carbon source which is 
converted into energy with different efficiency by known methylotrophic yeasts. Yeast species of the genus 

25 Hansenula are able to metabolise glycerol with high efficiency, whereas S . cerevisiae grows poorly on this 
carbon source. In addition, growth of Hansenula polymorpha on glycerol results in very little ethanol by- 
production which otherwise causes problems during large-scale fermentation. 

The regulons used for the expression cassettes of the present invention are preferably derived from 
yeast of the well known genera mentioned above. Regulons which have been characterised in detail are 

30 preferred, such as the regulating region from various methanol responsive genes of Hansenula polymorpha 
, for example the methanol oxidase gene, the FMD gene, DAS gene or catalase gene. The regulating 
regions of most of these genes are already sequenced and the sequences published (e.g. Janowicz et al., 
1985; Ledeboer et al., 1985, EP 87 110 417). The technical proceedure of isolating these regulons from 
appropriate microorganism is facilitated by the knowledge of the respective DNA sequences and cor- 

35 responding restriction patterns. 

As a terminator any sequence may be included which efficiently serves to terminate the nascent mRNA 
regardless of whether it is derived from a eukaryotic or prokaryotic gene. Preferred terminators are derived 
from yeast genes. Use of termination sequences obtained from the methanol responsive genes cited above 
is most preferred. 

40 It is another advantage of the microorganism according to the present invention that a surprisingly high 
number of expression cassettes can be introduced into the respective host cell and be stably maintained. In 
general, there are two possibilities to achieve stable maintenance of specific genetic information: 

The information is either encoded chromosomally or it is localised on an autonomously replicating 
vector, which may be a plasmid, a cosmid. a virus or the like. Normally higher gene dosages are provided 

45 by transformation of the desired host organism using high copy number autonomously replicating plasmids 
or lysogenic viruses. These plasmids or viruses are well known in the art and usually contain a replicon and 
a selective marker gene enabling replication and stable maintenance of the vector. However, viruses are 
often subject to conversion to lytic cycles and plasmids are prone to being lost under certain conditions and 
furthermore often require continuous selection pressure. Moreover even with high copy number plasmids a 

50 copy number of not more than about 50 to 75 can generally be obtained in yeast. On the other hand, 
introduction of desired genetic information, e.g. an expression cassette, Into the host genome normally 
requires a certain degree of homology between at least part of the DNA sequence to be inserted, preferably 
the termini of said molecule, and the site of insertion. Thus the number of potential integration sites for 
homologous recombination of a given molecule is limited, in most cases to just one. This number is slightly 

65 more elevated for homologous recombination with genes present in multiple copies or with transposon-like 
structures such as Ty (Jacobs et al., 1988). In order to promote the integrative recombination event, it is 
preferable to use vectors lacking a replicon but carrying a selective marker gene, which confers a 
detectable phenotypic trait to the transformed cell and allows identification under selective conditions. 
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Random recombination is normally a rarely observed event. However according to the present 
invention, it is possible to introduce numerous copies of any given DNA sequence into one or more 
chromosomes of the genome, by Insertion due to random recombination. To obtain transformants carrying 
as many as some hundred copies of the DNA sequence desired, it is preferable to use autonomously 
5 replicating plasmids for transformation, which by the simultaneous events of integration of some copies and 
- replication of others can provide a large pool of integratable copies, which are inserted during subsequent 
ceil cycles. Surprisingly it was possible to obtain methylotrophic yeast strains carrying many expression 
cassettes. 

Preferred yeast strains, such as yeast strains of the genus Hansenula , contain about 2 - 500, preferably 

70 15 to 200, especially 20 to 150 copies of a desired DNA sequence. Once the desired number of integrated 
expression cassettes is obtained further integration processes can be prevented by cycles of growth on 
non-selective medium to segregate out the remaining plasmid copies. 

A special feature of microorganisms according to the present invention is the phenomenon of 
multimeric integration. Upon transformation with a vector capable of autonomous replication Hansenula 

15 polymorpha was found to integrate several repeats of the plasmid used for transformation. Repetitions of up 
to about 100 copies of the introduced plasmid were observed. This allows the presence and integration of a 
high number of copies of foreign DNA sequences without jeopardising the cell's viability by insertion into 
essential genes. Surprisingly integrants containing multimeric DNA were shown to be stable. 

For the purpose of designing vaccines containing composite particles a constant ratio of the different 

20 polypeptides of said particles is desirable. In order to meet this need the present invention provides 
microorganisms having a selected ratio of different expression cassettes. For instance, a microorganism 
according to the present invention may carry between 1 to 100 copies of an expression cassette (ec1) 
encoding ali or part of a protein having the biological activity of S-protein and 1 copy or several copies of 
an expression cassette (ec2) encoding a protein having the biological activity of preS2- or preS1 -protein. 

25 Preferred microorganisms carry a ratio of between 2 copies of ec1 to one copy of ec2 to 30 copies of 
ed to one copy of ec2 more especially in the ratio of 5 to 10 copies of ed to one copy of ec2 or 6-8 
copies of ed to one copy of ec2. A third expression cassette can be introduced in the same ratio as ec2 or 
6-8 copies of ed to one copy of ec3. 

In addition to the possibility of preparing strains containing integrated copies of the expression cassette, 

30 it is also possible to use microorganisms containing autonomously replicating vectors comprising autono- 
mously replicating sequences (ARS) and the desired expression cassette. Presence and maintenance of 
vectors containing ARS are well known in the art. 

Subjects of the present invention are also DNA molecules containing an expression cassette (ed) 
coding for a first polypeptide and an expression cassette (ec2) coding for a second polypeptide and/or an 

35 expression cassette (ec3) coding for a third polypeptide, wherein the expression cassettes comprise: 

a) a regulon R which is responsive to methanol and/or depletion of catabolite-repressing carbon sources; 

b) an open reading frame coding for part (or parts) or ail of a protein having the biological activity of one 
of the hepatitis B surface antigens; 

c) optionally a DNA sequence serving as a terminator T; wherein R exerts control on the transcription of 
40 the open reading frame and T directs polyadenylation and/or termination of transcription of the mRNA 

produced. 

For the properties of the regulon, the open reading frame and the terminator referal is made to the 
above provided information. Also for these molecules it is preferable to derive both regulon and terminator 
from a methylotrophic yeast selected from the genera Candida , Kloeckera , Saccharomyces , Rhodotorula , 
45 Torulopsis , Pichia and Hansenula . In the most preferred embodiment the regulon and the terminator are 
derived from a gene involved in methanol utilisation, for example the MOX-gene, the FMD gene, the DAS 
gene or the catalase gene. Hansenula polymorpha is a preferred microorganism as a source for these 
genes. For the identity of the polypeptides encoded by ed, ec2 and ec3, referral is made to the 
explanation given above. 

so The DNA molecule according to the present invention may be composed of fragments obtained from 
the respective natural sources. The DNA coding for any of the hepatitis B surface antigens can be isolated 
from DNA of intact viruses, also called Dane particles. Isolation, cloning and sequencing of HBV virion DNA 
are disclosed in Charnay et al. (1979), Galibert et al. (1979), Sninsky et al. (1979) Vaienzuela et ai. (1979). 
The choice of restriction enzymes for cloning and DNA manipulation depends on the kind of subtype of 

55 hepatitis virus used. The subtypes differ not only by length, but also with respect to their individual 
restriction patterns which are subject to normal mutation events. It is possible for any person skilled in the 
art to identify the restriction fragment carrying the desired coding region, for example by cleavage of the 
isolated DNA using different sets of restriction endonucieases, subjecting the samples to gel electophoresis 
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and Southern blotting or equivalent methods and identifying the restriction fragment using a probe 
corresponding to a known surface gene sequence. If necessary, the DNA may be sequenced and further 
manipulated, for example by treatment with endonucleases, in vitro mutagenesis, primer repair or other 
known techniques in order to create a fragment having the desired size, carrying the desired sequence and 
5 having the appropriate termini. As is known to any one skilled in the art, the termini may be trimmed to 
have any desired sequence by digestion with exonucleases, filling in with the Klenow fragment of DNA- 
polymerase I and addition or substitution of DNA by the use of synthetic oligodeoxyribonucleotides as 
linkers and adapters. 

The DNA sequence coding for the respective hepatitis surface antigen may then be further combined 
jo with an appropriate regulon and/or terminator, which are obtained by known techniques from the genome of 

a methylotrophic yeast. The respective fragments are adapted accordingly, in order to be in a suitable 

position relative to the translation initiation codon and the translation termination codon of the coding DNA 

sequence. Ligation is also carried out according to known procedures. Since these methods are well known 

in the art, no further explanations are provided. For a detailed description of techniques used in molecular 
75 cloning, "Molecular Cloning" by Maniatls et al. (1982) may be consulted. 

Instead of ligating DNA sequences isolated exclusively from the respective natural sources, it is also 

possible to synthesise part or all of the desired expression cassette by means of chemical synthesis. 

Chemical synthesis may be conducted according to known procedures. Chemical synthesis of long 

oligodeoxyribonucleotides is already offered on a commercial basis. Furthermore, any combination of 
20 fragments synthesised chemically and fragments obtained from the respective natural source may be 

appropriate for the purposes of the present invention. 

A further subject of the present invention is the provision of the URA3 gene obtained from a 

methylotrophic yeast. The gene was identified by complementation of the corresponding pyr F mutation of 

E. coli strain MB 1000 with plasmids constructed by random insertion of Hansenula polymorpha restriction 
25 fragments into a suitable vector. A pfasmid was identified which complemented the deficiency of the 

recipient strain. The newly identified gene comprises approximately 1400 bp by measurement on agarose 

gels. Its restriction pattern is shown in Figure 10 below. 

In order to facilitate the construction the amplification and introduction of the expression cassette into 

the host microorganism it is advisable to insert this onto a vector suitable for transformation of a 
30 methylotrophic yeast. This vector may either be a linear or circular plasmid or a virus. It is possible to use 

so called integrative vectors, i.e. vectors lacking any autonomously replicating sequence (ARS). If these 

vectors are not integrated they will be lost during subsequent cell cycles. 

In one preferred embodiment the vector also comprises such ARS DNA sequences capable of directing 

autonomous replication. Preferably these sequences are derived from the respective host microorganism. In 
35 the case of methylotrophic yeast there are several ARS described, for example HARS, and PARS 

(Roggenkamp et al., 1986; Cregg et al., 1985). These sequences contain a region of DNA which functions to 

stably maintain the vector as an extrachromosomai entity within the host cell. 

In order to facilitate the identification of transformed host organisms it is further preferred to provide 

selective marker-genes suitable for selection of the desired microorganism following transformation. In yeast 
40 plasmids known in the art these selective marker-genes often correspond the DNA sequences coding for 

enzymes involved in the amino acid or nucleic acid metabolism, which are defective in the recipient host 

organism to be transformed. 

The same strategy may be applied for selection of transformed methylotropic yeasts. Genes, which are 

commonly used for selection of auxotrophic Saccharomyces cerevisiae strains include the LEU2, HIS3, 
45 TRP5, ARG4 and URA3 genes. Genes obtained from Saccharomyces cerevisiae may also be used as 

selective markers in transformation of methylotrophic yeast; however, growth of such transformants on 

minimal medium could be considerably impaired. Therefore, in a preferred embodiment of the present 

invention, methylotrophic yeast strains deficient in the function of any one gene involved in normal 

metabolism are transformed with a DNA molecule carrying a wild type homologous marker gene. The 
50 present invention provides for the first time a homologous marker gene for selection for transformants in 

Hansenula polymorpha , namely the URA3 gene. The characteristics of this gene, i.e. its restriction 

fragment pattern, is discussed above. 

As an alternative way of selecting transformants use can be made of antibiotic resistance genes. This is 

a well-known approach for bacterial transformation and has been shown to be applicable to S. cerevisiae 
55 (Jimenez and Davies, 1980). 

However, for methylotrophic yeasts so far nothing is known regarding their behaviour upon contact with 

antibiotics such as chloramphenicol or gentamycin G418. This is believed to be the first report of selection 

of transformed methylotrophic yeast by subjecting the same to selective concentrations of growth inhibiting 
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aminoglycoside antibiotics following the transformation procedure. Surprisingly, the aminoglycoside 
phosphotransferase derived from Tn5 (Grindley, 1980) could be shown to be active also in methylotrophic 
yeasts when regulated by a promoter obtained from bakers yeast. Inactivation of G418 is effective also in 
strains where only one or two copies of the gene are present in the cell, i.e. the gene is not required in a 
5 high copy number. According to the present invention this is a very efficient transformation/selection system 
in methylotrophic yeasts. 

In the most preferred embodiment the vector according to the present invention is selected from the 
vectors pMS-2, pFS-9. pRB-S, pRB-S-269. pRB-S-271, pRB-S-322 f pRB-S-326, pMPS22, pMPS21, pMPS9. 
The construction and use of these vectors will be described below in more detail. 

ro A further subject of the present invention is a process for the preparation of a microorganism as 
described above, where a methylotrophic yeast host organism is transformed with a vector carrying the 
expression cassettes ed and/or ec2 and/or ec3 and wherein upon replication of the vector at least some of 
the expression cassettes are integrated by recombination with the chromosome of the yeast host organism, 
preferably as a multimeric repeat As a consequence, the transformed methylotrophic organism may contain 

15 several copies of any introduced expression cassette dispersed over the whole genome, but also with a 
high probability that multimeric repeats of the expression cassettes are localised in one site of the genome. 
This type of recombination is found to occur especially in Hansenula . However, the methylotrophic yeast 
host organism may also be selected from any other species of methylotrophic yeast genera, for example, 
Candida , Kloeckera , Saccharomyces , Rhodotorula , Torulopsis and Pichia • 

20 In the most preferred embodiment the host organism is selected from the species Hansenula 
polymorpha , exemplified by a strain having the identifying characteristics of Hansenula poiymorpha RB 10. 
Hansenula "polymorpha RB 10 was deposited with the "Deutsche Sammlung von Mikroorganismen und 
Zellkulturen" on February 17, 1989 and received the accession No. DSM5215. H . polymorpha DSM 5215 
forms a further aspect of this invention. ~~ 

25 While it is preferred to perform the transformation using any one of the above-described vectors 
according to the present invention, it will be appreciated that other vectors may be constructed and used. 

In order to obtain microorganisms containing more than one copy of an expression cassette (ed) and 
optionally one or more than one copy of a second expression cassette (ec2), the following process 
comprising several steps is the most convenient way. According to the process, the methylotrophic host 

30 organism a) is transformed with a first vector and initially grown on selection medium, followed by 
incubation under non-selective conditions for several generations, b) mitotically stable transformants 
containing more than one expression casssette (ed) coding for a first polypeptide are selected, c) said 
transformants are optionally cured of residual autonomously replicating plasmids, d) said transformants are 
optionally transformed with a second vector and again initially grown on selective medium, followed by 

35 incubation under non-selective conditions for several generations, e) mitotically stable transformants 
containing more than one expression cassette (ed) coding for a first polypeptide and optionally one or 
more than one expression cassette (ec2) coding for a second polypeptide are selected, f) said transfor- 
mants are optionally cured of residual autonomously replicating plasmids, steps d) to f) are optionally 
repeated using a third vector. 

40 According to this process, it is possible to increase the number of expression cassettes of one type, for 
instance, ec1, by subjecting the cells to several subsequent cycles of transformation, growth under 
selective/non-selective conditions, selection of transformants and optionally curing of free plasmids. Alter- 
natively, two or more types of expression cassettes can be introduced by either using a single type of 
vector containing two or more different expression cassettes or subjecting the cells to the above described 

45 cycles, wherein transformation is performed using different vectors, carrying different expression cassettes. 
The process according to the present invention thus provides a variety of possibilities in order to obtain a 
desired microorganism. 

A subject of the present invention is also a composite particle, comprising at least two polypeptides 
corresponding to part or all of a protein having the biological activity of one of the hepatitis B surface 

so antigens, wherein the particles present at least two antigenic determinants provided by the S-protein, preS2- 
protein or preSl -protein, said particle optionally containing host specific lipids. In general, the particles are 
made of a majority of a protein having the antigenic characteristics of the S-protein, which also constitutes 
about 80 to 90% of naturally occurring 20 nm particles. The composite particles further may comprise 
either any protein having the antigenicity of preSl -protein specific domains, i.e. peptides corresponding to 

55 at least part of the N-terminal 108 (or 119) amino acids of preSl -protein, or/and any protein having the 
antigenicity of preS2 specific domains, i.e. peptides corresponding to at least part of the 55 amino acid 
preS2-specific domain. 

The presence of two or more different proteins having the biological activity of hepatitis B surface 
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antigens in one particle was not a known approach for the design of yeast-derived anti-hepatitis B vaccine 
at the relevant priority date of the present application. Yeast-derived particles or aggregates obtained by 
that date had contained exclusively either the S protein, or the preS2-protein or the preS1 -protein, i.e. only 
one of the three possible proteins. It had been shown by Valenzuela et al. (1985) and Itoh et al. (1986) that 

5 preS2-protein was assembled into particles, while preS1 -protein had been reported to form either a mixture 
of small particles (2 or 3 nm in diameter) or a polydisperse population of large aggregates some 15 to 50 
nm in diameter (Kniskem et al., 1988; Imamura et al., 1987). Before the filing date of the present 
application, however, a publication appeared (EP-A- 0 341 746) which reports a process for the enhanced 
production of antigenic particles consisting essentially of hepatitis B S protein and preS2 protein. 

70 Rutgers et al. (1988) report that on preS2-protein and preS1 -protein particles or aggregates, the S- 
protein encoded epitopes are masked as these particles are less reactive in the AUSRIA assay. This may 
have a deleterious effect on the immune response with respect to S-protein when particles composed of 
homogenous preS protein species are used as immunogens for vaccine purposes. 

The present invention as claimed provides new composite particles comprising more than one species 

75 of hepatitis B surface antigen. By analogy with particles comprising only one protein produced in S. 
cerevisiae ., the particles according to the present invention will normally comprise host-specific lipids. 

In a preferred embodiment the composite particles according to the present invention comprise more 
than 1% preS1 -protein, the rest of the protein being S-protein and optionally some preS2-protein. Preferred 
particles, however, contain 1-50%, especially 1-25% preS1 -protein, the rest being the same as above. In 

20 especially preferred embodiments, the ratio of preS1 -protein to S-protein is between 1:1 and 1:100, 
preferably 1:2. 1:8 or 1:15. 

The particles according to the present invention are at least partially glycosylated. A proportion of the 
preS1 -proteins in methylotrophic yeasts are glycosylated, as indicated by the detection of bands with 
molecular weights of 45 kD, 38 kD and 39 kD. It will be appreciated by one skilled in the art that the 

2$ number and size of preS1 protein species observed by immunoblotting may present some apparent 
variability. Detection of the 45, 38 and 39 kD bands will depend, inter alia , on their resolution during 
polyacrylamide gel electrophoresis, interference from other yeast proteins and variations in the efficacy of 
the procedures used in immunoblotting. Further, the method chosen for the preparation of crude cell 
extracts and the specific composition of the buffer used for extraction may also influence the detection of a 

30 particular band. The relative proportions of the three species may also vary with growth conditions. 
Furthermore, estimates of molecular weight are also subject to experimental variation and depend on the 
standards used for calibration. Glycosylate of S-protein has not been observed. 

The composite particles described above are conveniently prepared by cultivating a methylotrophic 
yeast strain according to the present invention in a suitable culture medium. Using a regulated system of 

35 expression, the synthesis of particle components can be controlled to avoid their expression affecting cell 
growth. Thus at any desired cell density, methanol regulon-controlled protein synthesis may be derepres- 
sed to permit expression of the heterologous protein. Upon coordinated expression of different types of 
hepatitis B surface antigens in close proximity, particle formation occurs. These particles can be isolated 
from the culture by means of well-known techniques, for example isopycnic centrifugation, chromatography 

40 and other methods known in the art. 

The same method may also be applied in order to obtain particles or aggregates comprising only one 
species of surface antigen, when appropriate strains are used. 

In a preferred embodiment, protein synthesis directed by the expression cassette starts upon depletion 
of repressing carbon sources. The cells, which were previously subject to catabolite repression, begin 

45 expression upon dilution of the repressing agent. The effect of derepression can be further enhanced by 
addition of methanol, which results in induction of a methanol responsive regulon. Addition of methanol 
results in up to 10 fold enhancement of heterologous protein synthesis. 

In a preferred embodiment, the culture medium contains glycerol as a carbon source. Glycerol is a 
carbon source having a minimal catabolite repressing effect. Glycerol concentrations of up to 0.3% do not 

50 effect repression. Therefore, when using a culture medium containing glycerol, this carbon source does not 
have to be removed completely before the optional addition of methanol. 

Fermentation of the methylotrophic yeast strains according to the present invention is carried out 
between 30 to 40°C using a medium maintained at pH 3 to 7, preferably pH 4 to 6. Examples of media are 
given below. 

55 In a preferred embodiment of the invention the composite particle comprises all or part or parts of the 
S-protein and all or part or parts of the preS1-S2-S protein (L protein) of HBV. 

In an especially preferred embodiment of this invention the composite particle may comprise any of the 
modified L proteins disclosed both generically and specifically hereinbelow. 
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In a particularly preferred embodiment the modified L protein corresponds to a 280 amino acid 
polypeptide containing residues 12-52, 133-145 and 175-400 relative to the ORF on a HB virus of ad 
serotype. 

In yet a further preferred embodiment of this invention the composite particle may contain hepatitis. B 
5 surface antigen polypeptides of differing subtypes to broaden the range of protection. That is the S-protein 
may be of ad subtype specificity and the preS1 -protein and/or the modified L protein and/or the preS2- 
protein of ay subtype or vice versa . 

The subject of the present invention is further a composition of matter comprising the new composite 
particle or hepatitis B surface antigen protein produced according to the present invention. Further 
10 components of the inventive composition may be stabilizers, buffering substances, diluents, salts such as 
NaCI, or sugars. 

As outlined above, the initial purpose of the present invention is to provide a vaccine against hepatitis B 
virus. Therefore, a vaccine comprising a composite particle as described above or hepatitis B surface 
antigen produced according to the present invention is a further subject of the present application. The 

75 vaccine may comprise further substances, for example salts, stabilizers, adjuvants, and/or other physiologi- 
cally acceptable additives. The vaccine comprising composite particles elicits formation of antibodies 
directed against at least two of the possible antigenic determinants of hepatitis B surface antigens. Due to 
the presence of preS1 specific and preS2 specific domains, even individuals who did not respond to 
presently available vaccines, may be protected against hepatitis B infection. In general the vaccine 

20 according to the present invention will evoke a broader immune response than present vaccines and 
furthermore can be produced at lower costs due to the high amount of expression per cell and the high cell 
density. 

The present invention has also proved to be useful for the development of a test kit By providing a 
composition comprising the composite particles according to the present invention, it is possible to monitor 

25 the formation of immune complexes formed between antibodies of an infected individual with such particles. 
The test kit requires further provision of a detection agent, for example, a goat/anti-human IgG antibody 
labelled by any of the known techniques. The antibody, which can of course also be derived from any other 
mammal, may for example be coupled to an enzyme or may be radioactively labelled to effect detection. 
The advantage of a test kit responsive to antibodies against preS1 and preS2 specific domains is the 

30 possibility of early detection of HBV infection, since anti-preS1 and anti-preS2 antibodies appear prior to 
anti-S antibodies during HBV infection in man (Petit et al., 1986). Thus the test kit according to the invention 
provides a convenient means for early diagnosis of this highly infective disease. 

In a further aspect the present invention provides a modified large surface protein (hereinafter modified 
L protein) of hepatitis B virus (HBV). One such modified L protein has an L protein sequence characterized 

35 by the absence of sites preferentially attacked by proteases. Elimination of these sites provides a less 
protease sensitive molecule, which upon expression produces a protein readily purified in particle form. 

Another aspect of the present invention is a modified L protein characterized by the absence of an 
amino acid necessary for myristylation of the protein. 

In yet another aspect the present invention includes a modified L protein which is characterised by the 

40 absence of several potential glycosylation sites. Elimination of the glycosylation of the L protein improves 
the presentation of epitopes which are shielded, distorted or modified by glycosylation of the protein which 
occurs during expression in yeast hosts. The modified, deglycosylated L protein presents the protective B 
epitopes and significant T H epitopes of the preS1 , preS2 and S region in their -correct conformation, well 
exposed on the surface of the resulting particle. 

45 A further aspect of the present invention includes a modified L protein characterised by a deletion of a 
part of the preS2 region, which includes the human serum albumin (pHSA) binding site. This modification of 
the L protein thereby eliminates the potential tolerance or auto-immune problems linked to the presence of 
the pHSA receptor without losing the major B epitopes of the preS2 region. 

In still a further aspect of this invention, a modified L protein is provided which is characterized as non- 
50 glycosylated, and having diminished protease sensitivity and diminished pHSA binding capacity. This 
protein contains the preSl region sequences corresponding to amino acid residues 12 to 52 fused to preS2 
region sequences corresponding to amino acid residues 133-145, followed by the entire coding sequence 
for the S protein in a continuous reading frame. 

The modified L proteins of this invention may be expressed not only in methylotrophic yeasts as thus 

55 far discussed but also in other yeast expression systems which produce the proteins as particles retaining 
the major antigenic sites encoded by the three domains preSl , preS2 and S. The modified L proteins may 
also be co-expressed with the S protein in yeast to form particles of mixed subunit composition. Other 
expression systems, e.g., bacterial, insect, mammalian may also produce the modified L proteins in various 



16 



EP 0 414 374 A2 



forms. 

As another aspect this invention also provides novel DNA sequences coding for the modified L proteins 
of the present invention, in another aspect of this invention, such DNA sequences may be incorporated in 
vectors under the control of appropriate regulatory sequences. 

s As still another aspect there is provided a method for producing a modified L protein of HBV as 
described above involving culturing a yeast cell transformed with a yeast expression plasmid expressing a 
selected modified L protein gene sequence under control of suitable regulatory sequences and employing 
suitable culture conditions. The modified L protein may also be co-expressed in a yeast host with S protein. 
The resulting L protein may be isolated from the culture or cell lysate by conventional techniques. The 

70 modified L protein may be isolated in the form of particles of either mixed or homogeneous subunit 
composition depending on its co-expression with S protein. 

As still a further aspect, the invention provides a method for producing modified L proteins in other 
expression systems, including bacterial expression systems; insect cell expression systems, e.g., 
baculovirus systems; mammalian cell expression systems; vaccinia virus systems and plant expression 

75 systems. 

Stiil a further aspect of this invention are vaccines comprising one or more modified L proteins of this 
invention, alone or in combination with other vaccinal agents and pharmaceutical^ acceptable vaccinal 
carriers and adjuvants. The vaccines of this invention comprising the modified L proteins described herein 
may also include the modified L protein in the form of a particle of mixed subunit composition with other 

20 peptides i.e. S protein. 

From the foregoing it will be appreciated that the novel modified large (L) hepatitis B surface protein 
molecules may be used alone or with other HBsAg or HBV components to provide vaccinal agents for use 
in immunizing persons against HBV infection. The invention also involves expressing a modified L protein in 
a suitable host cell under suitable regulatory control. 

25 The modified L protein of the present invention is characterized by deletion, insertion or modification of 
a variety of amino acid sequences in the preS region. It is to be understood that the L molecule to be 
modified and expressed here has already been deleted for its first eleven amino acids and contains 163 
amino acids in the preS1 region (see co-pending U.S. patent application SN 009,325 incorporated by 
reference herein). The resulting reduction in size of the preS region facilitates particle formation in the host 

30 cell and enhances various biophysical properties of the molecule, which may make the modified L particle 
particularly desirable for vaccinal manufacture and use. For example, the shortened preS regions of the 
present invention may enable better presentation of the S and preS epitopes on the particle for interaction 
with the immune system, thereby enhancing immunogenicity. 

In one embodiment, the present invention provides a modified L protein deleted for the Gly 13 amino 

35 acid residue in the preS1 region. The L protein expressed in yeast carries a myristyl group covalently 
attached in an amide linkage, as disclosed in copending U.S. patent application SN 07/009,325. An N- 
terminal Gly residue is an essential requirement for the myristylation of a protein [Towler and Gordon, Ann. 
Rev. Biochem. , 57:69-99 (1988)]. Therefore, one suitable modified L protein of this invention is encoded by 
DNA having a deletion of the Gly13 codon (GGG) from the nucleotide sequence encoding the L protein. 

40 This deletion results in the synthesis of a non-myristylated L protein. 
As used herein the symbol A denotes deletion: 
Thus, for example. A Gly 13 modified L means modified L protein lacking amino acid residue 13 (glycine). 

Another embodiment of the present invention is a modified L protein deleted for one or more of its 
potential glycosylation sites. The L protein expressed in yeast carries an N-iinked oligosaccharide chain at 

45 Asn123 and several O-linked mannose chains linked to Ser and Thr residues in both the preS1 and preS2 
regions of the molecule as described in U.S. patent application SN 009,325, identified above [see, also, 
Rutgers et al, "Viral Hepatitis and Liver Disease" ed. A. J. Zuckerman, A. R. Liss, New York, pp. 304-308 
(1988)]. To remove the N-glycosylation site which is characterized by the sequence Asn-X-(Ser or Thr), 
where X may be any amino acid, the sequence at amino acid positions 123-125 is altered. The N-linked 

so glycosylation site may be eliminated by deleting or replacing the amino acids at positions 123 and 125, 
deleting amino acid 124 or deleting the entire 123-125 sequence. To destroy the O- linked glycosylation 
sites, ail or some of the Ser and Thr residues in the L sequence are deleted from, or replaced with, other 
amino acids in the protein. The L protein expressed from illustrative plasmid pRIT13192 is characterized as 
wholly or partially non-glycosylated. 

55 Another modified L protein of the invention contains a deletion in the preS2 sequence at amino acid 
residues 120-132 of the preS2 region responsible for binding of polymerized human serum albumin (pHSA) 
[see Machida et al, cited above]. pHSA binding is greatly diminished on the proteins lacking this sequence. 
Another modified L protein according to the invention is characterized by deletion or alteration of the 
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protease sensitive sites. Protease sensitive peptide bonds exist in the preS sequence behind the Arg 
residues around position 100 in the preS1 region [Heermann et al Intervirology 28 , 14-25 (1987)] and 
behind amino acid residues Arg137, Giy149 and Arg167. Deletiohsor other modifications introduced in the 
coding region of the L protein to remove some or ail of the codons for these residues produce an L protein 
5 (ess sensitive to protease action. 

The deletions introduced in the coding region of the L protein to form modified L proteins preferably 
leave untouched the codons for amino acids 21-47 and 133-145. These sequences are believed to provide 
the modified protein with the capacity to induce preS-specific antibodies correlated with immune protection 
[see. e.g., Itoh et al, supra]. 

w Construction of an exemplary yeast expression plasmid pRIT12979 encoding the A Qly13 L protein is 
described in Example F.1 below. Analysis of the modified L protein expression from plasmid pRIT12979 for 
expression level, post-translational modifications and macromolecular assembly is described in Example 
F.2. 

Another modified L protein, devoid of all three of the above mentioned properties, i.e. to function as a 

75 substrate to yeast giycosylation enzymes, pHSA binding capacity and protease sensitivity, and modified by 
removal of the codons for amino acid residues 53-132 and 146-174 of the preS region is illustrated in 
plasmid pRIT13192, described in Example F.3 below. Plasmid pRIT13192 contains the codons for residues 
12-52 and 133-145 of the preS region. Analysis of L protein expression from plasmid pRIT13192 is also 
described herein (Example F.4). The L protein expressed from pRIT13192 thus shows a greatly reduced 

20 pHSA binding capacity, an improved protease resistance and less giycosylation. 

Yet another exemplary modified L protein of the present invention combines the deletions introduced in 
the L protein coding sequences of pRIT12979 and pRIT13192. The L protein expressed by this sequence 
codes for a non-myristyiated, less glycosylated, less protease sensitive L protein with diminished pHSA 
binding capacity. Construction of the yeast expression plasmid pRIT13193 bearing this modified L 

25 sequence insert is also described in detail in Example F.3 below. 

Modified L protein coding sequences, including the entire S coding sequence, for use in the present 
invention can be prepared from these S and preS protein coding sequences of Table A by conventional site 
specific mutagenesis [see, eg., Botstein et al, Science , 229 :1193 (1985)] or recombinant DNA techniques, 
[see, e.g., T. Maniatis et al, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory (1982)]. 

30 The amino acid position numbers referred to below will correspond with the position numbers in Table A for 
clarity. However, other published versions of the S, and preS sequences can be used in preparing the 
modified L proteins of this invention by one of skill in the art 

For example, the S protein can be isolated from DNA extracted from Dane particles in infected human 
serum by filling in the single strand region witb a DNA polymerase, preferably the endogenous polymerase, 

35 followed by digestion with a restriction endonuclease. The choice of endonuclease will depend, in part, on 
the particular Dane particles. For example, the HBsAg coding sequence of HBV DNA of certain Dane 
particles of the adw serotype can be isolated on a single BamHI fragment; the HBsAg coding sequence of 
HBV DNA of certain Dane particles of the ayw serotype can be isolated on a Hha I fragment. HBV DNA of 
Dane particles of the same serotype may alioexhibit different patterns of restriction sites. 

40 Construction of the modified L protein sequences of this invention can be accomplished by use of 
intermediate vectors. Techniques for cloning DNA fragments in phages are disclosed, for example, by 
Chamay et al, Nature , 286 :893-895 (1980) and Tiollais, United Kingdom Patent Application 2,034,323. 

The modified L protein of this invention may be constructed in a recombinant DNA molecule or vector. 
A recombinant vector of this invention comprises the modified L protein coding sequence of this invention 

45 operativety linked to a regulatory region. Such vector may also additionally contain a replicon depending 
upon preference and/or the host to be employed. By the term "replicon" is meant that minimum region of 
DNA which functions to maintain stably, and extrachromosomally, a recombinant vector in a eukaryotic or 
prokaryotic host organism transformed with such vector. Such replicons are well-known in the art. By the 
term "regulatory region" is meant any DNA sequence which contains a promoter region and other 

so sequences necessary for transcription of a coding sequence which regulates transcription of a structural 
gene's coding sequence. Such regulatory regions are well-known in the art. 

Such a vector can be prepared by conventional techniques such as by inserting the modified protein 
coding sequence of this invention into a vector already containing a replicon and regulatory region in such a 
way that the DNA molecule is operatively linked to such regulatory region. This DNA molecule, or an 

55 expression cassette containing the modified L protein coding sequence, can be constructed by conventional 
techniques, such as by ligating the L coding sequence to the regulatory sequence. 

Restriction of DNA to prepare DNA fragments used in the invention, ligation of such fragments to 
prepare recombinant DNA molecules used in the invention and insertion into microorganisms are carried out 
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by known techniques, such as techniques disclosed in the previously and subsequently cited references. 
Conditions are selected to avoid denaturation of the DNA and enzymes. Restriction enzymes and ligases 
used in carrying out this invention are commercially available and should be used in accordance with 
instructions included therewith. T4 DNA ligase is the preferred ligase. 

The various fragments and final constructions used for the expression of the modified L proteins, and 
the components of the vectors carrying these coding sequences for expression in host cells in the method 
of the present invention may be joined together using with conventional methods known to those of skill in 
the art. In many cases, genes have been isolated and restriction mapped, as well as sequenced. To that 
extent, one can select the sequence of interest, such as the HBV envelope protein coding sequence, by 
restriction of the gene, employing further manipulation as necessary such as resection with Bal31 , in vitro 
mutagenesis, primer repair, or the like, to provide a fragment of a desired size, including the desired 
sequence, and having the appropriate termini. Linkers and adapters can be used for joining sequences, as 
well as replacing lost sequences, where a restriction site employed was internal to the region of interest. 
The various fragments which are isolated, may be purified by electrophoresis, electroeluted, ligated to other 
sequences, cloned, reisolated and further manipulated. 

The use of regulatory regions for controlling transcription of the modified L gene of interest may allow 
for growing the host cells to high density with no or low levels of expression of the structural gene, and then 
inducing expression by changing the environmental conditions, such as nutrient, temperature, and the like. 

The vector carrying the L protein coding sequence is transformed into, and expressed in a selected 
host cell by conventional techniques. The cell is then cultured in appropriate culture media, i.e., media 
which enable the host to survive and express the modified L protein in recoverable quantity. For example, 
where a yeast host cell is selected, an exemplary medium for such use is Yeast Nitrogen Base minimal 
medium. The appropriate culture media may be selected by one of skill in the art depending upon the 
particular host employed in the method. 

The isolation of the modified L protein molecule of the invention may be from a cell lysate or extract of 
the host cell culture medium depending upon the cell's ability to secrete the modified protein. Recovery of 
the protein of this invention is performed by conventional protein isolation techniques. 

For producing modified L proteins of this invention, a suitable host cell or cell line may be selected. 
Suitable cells or cell lines may be yeast cells. Many strains of yeast cells known to those skilled in the art 
are available as host cells for expression of the modified L proteins of the present invention. In the practice 
of this invention, any species of yeast may be employed. In one aspect, the examples below employ 
Saccharomyces , specifically S. cerevisiae . Numerous strains of S. cerevisiae are available from public 
depositories and laboratories, elg., American Type Culture Collection, Rockville, MD. 

In another aspect Hansenula polymorpha may be used. 

Other yeast species may also be employed including, without limitation, Hansenula , Pichia , Candida , 
Kluyveromyces and Schizosaccharomyces . 

Additionally, other species of eukaryotic host cells, e.g., mammalian cells, such as Chinese hamster 
ovary cells (CHO) or 3T3 cells, may be employed as host cells for expression of the modified L proteins 
described herein. Selection of suitable mammalian host cells and methods for transformation, culture, 
amplification, screening and product production and purification are known in the art. See, e.g., Gething and 
Sambrook, Nature , 293 :620-625 (1981), or alternatively, Kaufman et al. MoL CelL Biol 5 (7) 1750-1759 
(1985) or Howley at aTD. S. Patent 4,419,446. Other suitable mammalian cell lines, are the monkey COS-1 
cell line, and the CV-1 cell line. 

It will be realised that some of the deletions introduced in the L protein coding sequence described 
above have altered the internal sequences engaged in the transcription and translation of M and S specific 
mRNA in mammalian cells. Such a modified L protein coding sequence allows, therefore, independent 
expression of S and modified L protein in a mammalian cell harbouring their individual expression 
cassettes. 

Similarly useful as host cells suitable for the present invention are bacterial cells. For example, the 
various strains of E. coli (e.g., HB101, MC1061 and strains used in the following examples) are well-known 
as host cells in thelieicTof biotechnology. Various strains of B. subtilis , Pseudomonas , other bacilli and the 
like may also be employed in this method. 

Additionally, where desired, insect cells may be utilized as host cells in the method of the present 
invention. See, e.g. Miller et al, Genetic Engineering , 8 :277-298 (Plenum Press 1986) and references cited 
therein. 

Some viruses may also be employed to express the L proteins of this invention, such as baculoviruses 
or vaccinia viruses. Selection of appropriate host cells is within the ability of one of skill in the art, as is the 
selection of appropriate culture conditions, media and vector components appropriate to expression in a 
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selected host cell. 

In the expression systems described above and below in the Examples, any known promoter capable of 
controlling the expression of the protein coding sequences in the selected host cell may be selected. 
Where the host ceil of choice is a yeast cell, the promoters useful for expressing the modified L protein 
s cassettes may include the strong promoter TDH3 or other constitutive or regulated promoters which allow 
modulation of the level of the L protein depending on the composition of the growth medium. Additionally, 
for example, PGK, ARG3, and APH1 are illustrative promoters useful in the vectors of the present invention. 

The present invention also provides for expression of the modified L proteins of this invention as part of 
an HBsAg particle of mixed polypeptide composition, as described in detail in copending U. S. Patent 
10 Application SN 07/368,401, which is incorporated by reference herein. The method involves coexpressing in 
a yeast cell a first protein S and a second protein X-S, where S is substantially the major surface protein of 
HBV and where X, for purposes of the present invention, is a modified preS sequence encoded by a 
functional DNA coding region fused in phase to the S coding sequence. 

Where reference is made to co-pending U.S. Patent Application No. 07/368,401 it should be noted that 
is substantially the same disclosure is to be found in Jacobs et al I, Gene 80 : 279-291 (1989). 

According to this method the S protein coding sequence is coexpressed with the modified L protein 
amino acid sequence. The coding regions of the L protein, as well as the sequence for the S protein may 
be constructed as described above using conventional chemical synthetic or recombinant DNA technology. 
The modified L proteins are incorporated together with S proteins into particles of mixed subunit 
20 composition. 

In the construction of the X-S proteins comprising modified L proteins of the present invention, S is 
preferably the entire mature surface protein (about 226 amino acids) of HBV (see Table A). Alternatively, the 
S protein may be encoded by an S protein coding sequence of the HBV genome, wherein S is substantially 
the same as an authentic HBsAg S protein. Truncates of S can also be employed in the construction of the 
25 modified L protein containing mixed particles, as long as particle assembly is not adversely affected. 

The S protein coding sequence is placed into a first expression cassette employing conventional 
techniques to allow expression of the S protein in the yeast host. The S protein coding sequence or a 
substantial portion thereof is employed in a fusion with portions of the preS coding sequence and placed 
into a second expression cassette to allow production of a modified L protein of this invention. 
30 A vector, constructed by conventional techniques as described above, carrying the S protein expression 
cassette and one or more vectors carrying the modified L protein expression cassette are transformed into 
into a yeast host cell by conventional techniques. 

Alternatively, S protein expression cassettes and expression cassettes for one or more modified L 
proteins are placed together on the same vector which is transformed into a yeast host cell. 
35 Vectors that can be used are Ty integrative vectors as described in detail in co-pending U.S. patent 
application SN 386,401 . Another alternative method employs compatible autonomously replicating vectors 
that carry the desired expression cassettes and transform the yeast host cell. 

Examples of vector systems are described in detail in Meilor et al., Yeast , 2 :145 (1986); Boeke et ai. t 
Cell, 40 ,491 (1985), Boeke et al., Cell , 42 :507 (1985) and copending UlTpatent applications SN 101,463; 
40 233,631 ; 368,401 and 009,325 (Eu7opean~Patent Application Publication No. 0 278 940) which applications 
and references are incorporated herein by reference. 

Additionally other species of host cells may be employed for expression of the composite HBsAg 
particles of mixed polypeptide composition described herein. Suitable eukaryotic hosts are the same as 
those listed hereinabove for expression of the modified L proteins of the invention, e.g. mammalian cells 
45 such as CHO cells and the like. Insect cells and viruses such as baculoviruses and vaccinia viruses 
mentioned hereinabove may also be used as host cells. 

A preferred embodiment of the method according to the present invention employs Ty integrative 
vectors carrying the S protein expression cassette and autonomously replicating plasmid vectors carrying 
the modified L protein expression cassette. The use of Ty vectors carrying the S protein expression 
so cassette allows these sequences to be stably integrated into the chromosomes of the yeast host cell. Such 
integrations are stable due to their low frequency of excision. The vectors are thus homogeneously 
distributed in the cell population, allowing a similar rate of protein synthesis in each cell. 

Transformation into these cells of autonomously replicating plasmids carrying the expression cassette 
for the modified L protein ensures that for a definite amount of S protein synthesis in each cell, the level of 
55 modified L protein synthesis varies according to the copy number of the plasmid present in the cell. Due to 
the variable S/L expression ratio in each cell, the particles derived from a total cell population will dispose 
the peptides in all possible associations. 

In a further preferred embodiment both the S protein expression cassette and the modified L protein 
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expression cassette may be inserted on Ty integrative vectors and separately integrated into yeast host 
cells of opposite mating types. Strains carrying the desired numbers of S cassette and modified L cassette 
integration events may then be crossed to obtain diploid cells expressing both the S and modified L 
polypeptides. If desired haploid segregants carrying both types of expression cassette may be obtained by 
sporulating such diploids. 

The cell may be cultured in appropriate culture media, i.e., media which enable the host to express the 
S protein and modified L protein in particles of mixed subunit composition in recoverable quantity. 

Resulting modified L proteins will be produced simultaneously in yeast cells together with the S protein 
and assembled as mixed composite particles. Such particles are herein defined as a multimeric assembly 
of two or more polypeptides into a particulate structure, which is therefore characterised by a composite 
nature being of mixed polypeptide composition. The different peptides are disposed in a spatial association 
at the surface of the mixed particles. The presence of the novel modified L protein on the HBsAg particle 
will elicit an immune response similar to that associated with recognition of the native determinant. The 
present invention thereby enables the production of HBsAg particles in yeast which mimic natural particles 
insofar as they contain the essential antigenic sites from the S, M and L proteins of HBV. 

The isolation of the mixed particle of the invention from a cell lysate or extract of the culture medium is 
performed by conventional protein isolation techniques. 

In the practice of this method of this invention, any species of yeast host for which transformation, 
cloning and expression systems are available may be employed in developing "mixed particles" of HBsAg. 

In other yeast genera no Ty sequences are present and therefore vector systems must be designed to 
provide coexpression of the S and modified L protein coding sequences in these genera to create the 
mixed particles described. Therefore, expression cassettes may be designed for integration into the yeast 
chromosomes employing other repetitive DNA sequences or piasmids such as ARS based plasmids in 
yeast such as Pichia , allowing integration into the chromosomes. An example of such applicable ARS 
plasmids is described in Cregg et al, Mof . CeU . Biol . t 5 (12) 3376 (1985). 

In the expression systems described above, e.g. both the Ty expression systems and others, the 
regulatory element comprises a promoter which effects RNA polymerase binding and transcript ion. 
Regulatable, i.e., inducible or derepressible, promoters, can also be used. A variety of useful promoters are 
available for expression of heterologous polypeptides in yeast. These include the copper inducible 
metallothionein gene (CUP1) promoter and the constitutive promoter of the glycolytic genes, 
glyceraldehyde-3-phosphate dehydrogenase (TDH3) and alcohol dehydrogenase (ADH). Regions for tran- 
scription termination in yeast are derived from the 3' end of any of several yeast genes, for example, the 
gene for iso-1 -cytochrome C (CYC1). Expression systems for use in Kluyveromyces are disclosed, e.g., in 
PCT WO83/04050, Hollenberg et al; in Schizosaccharomyces , e.g., in EP-A-1 07170, Yamomoto; in Pichia , 
e.g.. in Cregg et al, Bio/Technology , 5 :479 (1987); and in Hansenula . e.g., in EP-A-299108, Hollenberg et 
al. 

Specifically provided by the examples of this invention are S. cerevisiae strains capable of synthesizing 
S protein as a result of the genomic integration of S protein expression cassettes directed by Ty based 
vectors. One such vector, pRIT13133-L. containing the TDH3-S coding sequence - ARG3 cassette and the 
URA3 marker is described in Example 1 of co-pending U.S. patent application SN 368,401. Another Ty1 
vector, pRIT13034-L, contains the same expression cassette and URA3 marker but contains in addition the 
CUP1 gene which can be used as an additional marker in CUP1 S or cupl A recipient yeast strains. 

Preferred yeast strains are devoid of functional chromosomal CUP1 genes, which are sensitive to 
copper toxicity, in order to readily select transformants having integrated several copies of vectors. Two 
preferred cupl A yeast strains EJ cupl A 3d and EJ cupl A 7b are described in Example 9 of co-pending 
U.S. application SN 368,401 . 

Yeast strains containing several integrated copies of either pRIT13133-L or PrRIT1034-L are described 
in the Examples below. 

Transformation into these yeast strains of autonomously replicating yeast expression plasmids carrying 
an expression cassette for a modified L protein is also described in the Examples below. 

Diploid yeast strains carrying several integrated copies of both S and modified L expression cassettes 
are described in Example F.10 below. 

The method of the present invention for producing such modified L proteins alone or in particles of 
mixed subunit composition has advantageous utility in the production of vaccines to pathogenic microorgan- 
isms! For example, the modified L proteins described in the various vaccine formulations comprising 
hepatitis B surface antigen and may provide benefits not obtained with known HBV vaccines. The HBsAg 
particle of mixed subunit composition containing a modified L protein resembles particles or filaments or 
virions present in the blood of HBV infected persons that contain preS2 and preS1 epitopes exposed at 
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their surface in addition to epitopes from the major proteins and may also prove useful in novel vaccine 
formulations. The modified L proteins or mixed particles of mixed subunit composition containing them may 
have immunogenic properties that are not obtained with individual mixtures of S, M and L proteins. 

In further embodiment of this invention the particles of mixed subunit composition may contain modified 

s L proteins and S proteins with different subtype determinants so as to broaden the range of protection. That 
is the modified L protein may be of ay subtype specificity and the S protein of ad subtype or vice versa . 

Thus this invention encompasses vaccines containing the modified L proteins of the invention or a 
particle containing the modified L protein either in mixed (S, modified L) or homogeneous (modified L 
protein) subunit form. Such vaccine will contain an immunoprotective quantity of the particles or modified L 

10 proteins, i.e., enough of the proteins or particles are administered to elicit sufficient protective antibody 
response against HBV without serious side effects. Such vaccines may be prepared by conventional 
techniques. For example, a vaccine for stimulating protection against HBV infection in humans may contain 
the particle and/or the modified L proteins described above and a suitable conventional carrier. The particle 
or the L protein may be in an aqueous solution buffered at physiological pH for direct use. Alternatively, the 

75 protein or particle can be admixed or adsorbed with a conventional adjuvant, such as aluminum hydroxide 
or aluminum phosphate. Such a particle or modified L protein may also be combined with other 
immunogens to prepare combination vaccines. See, e.g., New Trends and Developments in Vaccines, eds. 
Voller et al. University Park Press, Baltimore. MD (1978). 

These modified L proteins, alone or in particles of mixed or homogeneous subunit composition, as 

20 described, thus broaden the spectrum of antibodies against HBV, and may provide an earlier response to 
viral challenge by the person vaccinated with the vaccines containing these proteins. Additionally the 
vaccines containing these modified L proteins may enable persons who do not respond to S proteins to 
mount a preS response, or may augment or enhance the S response. 

Another use for the particles of this invention and/or the modified L proteins may be as probes or 

25 reagents for detection of HBV or HBV derived proteins or anti-HBV antibodies in biological samples by 
various conventional immunoassays and the like. 

It will be appreciated that all vaccines described herein can be administered by an appropriate route, 
e.g., by the subcutaneous, intravenous or intramuscular routes. The amount of the immunogen in each 
vaccine dose is selected by the attending physician with regard to consideration of the patient's age, 

30 weight, sex, general physical condition and the like. The amount to induce an immunoprotective response in 
the patient without significant adverse side effects may vary depending upon the immunogen employed and 
the optional presence of an adjuvant. Generally, it is expected that each dose will comprise 1-1000 
micrograms of protein, preferably 1-200 micrograms. Initial doses may be optionally followed by repeated 
boosts, where desirable. 

35 Accordingly the present invention further provides a method of treating or preventing hepatitis B 

infection in humans which comprises giving to a patient in need thereof a protective amount of a vaccine 

according to the invention. 

Especially preferred vaccines according to the invention are those which comprise composite particles 

of structure (L*. S) where L* is as defined below and S denotes the S-protein of hepatitis B surface antigen, 
40 most preferably expressed in S . cerevisiae or H . polymorpha as described herein. 

As used herein the symbol L* denotes a modified L protein comprising residues 12-52, 133-145 and 

175-400 relative to the large open reading frame on HB virus of ad serotype shown in Table A hereinabove. 

The symbol A gly L* similarly denotes a modified L protein comprising residue 12 followed by residues 14- 

52, followed by residues 133-145 and 175-400. 
45 The following Examples illustrate the invention. 



Materials 

50 

1. Plasmids 

Plasmids mentioned in the specification are described in Part A below. 

55 

2. Yeast strains 

Hansenula polymorpha RB 10 ( ura 3) is a mutant of Hansenula polymorpha ATCC 34438, obtained by 
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mutagenesis using ethylmethansulfonate (EMS) essentially as described by Roggenkamp et al. (1986). 

3. Enzymes 

Restriction endonucleases, T4 DNA ligase and other enzymes used, were obtained either from 
Boehringer, Mannheim, FRG or Bethesda Research Laboratories, Eggenstein, FRG. The enzymes were 
used according to the instructions of the respective manufacturer. 

4. Immunological Reagents 

Monoclonal antibodies used included the following: 

a) S1.1: mouse monoclonal antibody specific for the S1 domain of preS1-protein (SmithKline Biologicais, 
Rixensart, Belgium) 

b) S2.1, S2.2 and S2.5; mouse monoclonal antibodies specific for the S2-domain of hepatitis B surface 
antigen (SmithKline Biologicais, Rixensart, Belgium). 

Antibodies specific for the S1 -domain and S2-domain of hepatitis B surface antigen are referred to as 
"preS-antibodies". 

c) RF6: Mouse monoclonal antibody specific for a denaturation and reduction resistant epitope in the S 
domain of hepatitis B surface antigen (H. Thomas, Royal Free Hospital, London). 

d) RF1: Mouse monoclonal antibody specific for a conformational dependant epitope located in the S- 
region (H. Thomas, Royal Free Hospital, London). 

e) HBS1 : Mouse monoclonal antibody directed aginst a denaturation and reduction resistant epitope of 
plasma-derived HBsAg, located in the S-region (SmithKline Biologicais Rixensart, Belgium), This mon- 
oclonal competes in binding experiments with Mab RF6. 

Antibodies a), b) and e) were prepared by the applicant according to standard procedures by fusion of 
mouse spleen cells with myeloma cells. For the method reference is made to Kohler and Milstein, 1975. 
Methods are disclosed in Goding, Monoclonal Antibodies, Principles and Practice, Academic Press, London 
1983. Other monoclonal antibodies mentioned hereinbelow were prepared similarly. 

f) AUSRIA (Abbott Laboratories, Wiesbaden-Delkenheim, FRG and Louvain-La-Neuve, Belgium): com- 
mercial kit containing polyclonal antibodies directed against native conformational epitopes. 

g) AUSZYME (Abbott Laboratories, Wiesbaden-Delkenheim, FRG and Louvain-La-Neuve, Belgium): 
commercial kit containing monoclonal antibodies directed against a native conformational epitope. 

The antibodies mentioned under f), g) are commercially available. They do not react with monomeric 
hepatitis B surface antigen. 

h) AUSAB (Abbott Laboratoires, Louvain-La-Neuve, Belgium): Commercial kit for detection of anti HBsAg- 
antibodies. 



5. Media 

Pepton 190, yeast extract, Casamino acids and yeast nitrogen base used for preparation of media were 
purchased from Gibco Ltd., Paisley, U.K. 

In order to prepare agar plates, 18g agar (Gibco Ltd.) per litre were added prior to autoclaving the 
respective medium. 
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a) SMR: Semi-rich non-selective medium 


Pepton 190 


2 g/l 


Yeast extract 


1 g/l 


Casamino acids (Pepton 5) 


3g/l 


Yeast nitrogen base W/O (NhUhSO* 


1.4 g/l 


(NhUfeSO* 


5 g/l 


Carbon source: glucose 


20 g/I 


or glycerol 


20 g/l 


or methanol 


10 g/l 


SMR containing methanol is also called Met-SMFL 



15 



b) YNB: Selective medium 


Yeast nitrogen base without (NHO2SO* (Gibco or Difco) 


1.4 g/l 


(NHO2SO4 


5 g/l 


Carbon sources: glucose 


20 g/l 


or glycerol 


20 g/l 


or methanol 


10 g/l 
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c) YEPD: 




Non-selective rich 


medium 




Yeast extract 


10 g/l 


Pepton 190 


20 g/l 


Glucose 


20 g/l 


If necessary, 1 mg 


gentamycin G418 was 


added per 1 ml of 


medium. 





40 



45 



50 
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d) S-medium 


COMPONENT 


AMOUNT PER 




LITRE 


(NHaUSOa 


5g 


KHoPOi 


4Q 


rVlyOW4./ n2w 


2 q 


INdwl 


0.1 q 




0.4 g 


1 .UliofiHino mnnnhvrlrnphlfiHHA 
LrrlloUUiHo mui luuyui uv^i mkji iuo 


10 mg 


LU~M©uiionin© 


9 ma 


lu- i ryptopnan 


Q mn 

0 11 ly 




2 ma 




0.16 mg 


l\l 


0.4 mg 


r6Ul3.on2U 


0 8 ma 


ft /lr% G f~\ . |_J_ (~\ 

MnoU4..ri2U 


1 1 @ mg 


IN 32 IV1OU4 .tn2 U 


0.8 mg 




1.6 mg 


Diotin 


0 16 ma 


Ca-pantothenate 


32 mg 


Folic acid 


0.16 mg 


Inositol 


160 mg 


Niacin 


0.4 mg 


p-Amino-benzoic acid 


0.2 mg 


Pyridoxine hydrochloride 


32 mg 


Riboflavin 


0.2 mg 


Thiamine hydrochloride 


32 mg 



II. Methods 



1. Handling of DNA 

Cleavage with restriction endonucleases, ligation using T4 DNA ligase, phosphorylation of DNA, 
isolation of DNA from E. coli , separation of DNA by gel electrophoresis, and all other techniques belonging 
to the field of gene technology used in these applications were essentially performed as described by 
Maniatis et al., 1982. 



2. Transformation of yeast 

Methylotrophic yeast is transformed by the methods described previously (Roggenkamp et al., 1986). 
The method using intact frozen cells is preferred (Klebe et al.. 1983). 

a) Transformed cells are identified by complementation of ura3 deficiency or by resistance to gen- 
tamycin. 

Resistance against gentamycin G418 was checked by plating the cells on YEPD plates containing 25 ml 
solidified medium. After 6 hours of growth on the plates 1 ml aqueous solution containing 25 mg 
gentamycin 418 was applied onto each plate and the incubation continued for 2 to 3 days. Gentamycin 
G418 (Trade Mark Geneticin) was obtained from Gibco std. 

b) Segregation of mitotically stable transformants was performed by selecting transformants from the 
selective transformation plate and allowing them to grow in a liquid selective medium (YNB or YEPD with 
G418 at 1 mg/ml) for about 20 generations (two passages). After this growth period cells were inoculated 
into a non-selective medium (SMR or YEPD) and allowed to grow for another 40-60 generations (5-6 
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passages). The cells were then plated on a selective plate (YNB or YEPD with G418 at 1 mg/ml) and 
clones still expressing the selective marker were retained. 



3. Isolation of yeast DNA 

Total DNA from yeast was prepared as described in Struhl et al. (1979). 



4. Analysis of yeast DNA 

The structure of the DNA in yeast transformants was analysed by digestion with appropriate restriction 
endonucleases followed by agarose gel electrophoresis. DNA fragments were then transferred to nitrocel- 
lulose and hybridized with the appropriate radioactively labelled DNA probe (Southern 1975). 



5. Immuno assays 



Western Blot Analysis 

a) Amersham procedure 

Proteins obtained from crude extracts or from gradient fractions were separated by SDS-polyacrylamide 
gel electrophoresis (Laemmli, 1970) using 12.5% separating, 5% stacking gels. The protein bands were 
transferred onto nitrocellulose essentially as described by Towbin et al. (1979). 

The antigenic material was detected by reacting the blot with specific antibodies. The antigen-antibody 
complex was visualised by reacting the filter with biotinylated antibody RPN 1001 (Amersham Buchler 
GmbH and Co. Ltd., Braunschweig, FRG) in a 1 :300 dilution followed by an incubation with peroxidase- 
streptavidin-complex RPN 1051 (Amersham) All steps of the experiment were performed according to 
the instructions of the manufacturer. 

b) Alternatively, the filter was preincubated with 0.05% Tween (w/v) in PBS. Following incubation with the 
given monoclonal antibody, antibody binding was detected by successive incubation with biotinylated 
sheep anti-mouse IgG (RPN 1021, Amersham) in a 1:250 dilution containing 0.05% Tween 20, 
streptavidin-biotinyiated horse radish peroxidase complex (RPN 1051, Amersham, dilution 1:400 in PBS 
containing 1% gelatin) and finally 30 ul H2O2 and 10 ml HRP Colour Development reagent containing 4- 
chloro-1-naphthol (3 mg/ml methanol) in 50 ml PBS. Between incubations the filter was washed with PBS 
containing 0.05% Tween 20. 

Quantification of produced monomeric antigens (S and preS) was achieved by comparison of samples 
of 0.01-0.3 ug of yeast-derived purified HBsAg (SmithKline Biologicals) to 3-5 different dilutions of crude 
protein extracts of Hansenula containing 2 to 20 ug total cell protein. 

The protein concentration was determined using a BioRad protein assay kit (BioRad Munich, FRG). 



EXAMPLES: PART A 



Plasmids 



Example A.1 



Construction of plasmids enabling expression of protein in Hansenula polymorpha 



a) Construction of a basic plasmid containing an Hansenula polymorpha autonomously replicating sequence 
(pME4). 
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The parental plasmid used to construct plasmids containing the S-gene under control of Hansenula 
promoters is pME4, which is already disclosed in the Ph.D thesis of M. Eckart, 1988. pME4 is a derivative 
of Ylp5 (Struhl et al.. 1979; Stinchcomb et a!. ( 1980) obtained by the following modifications: 

The Hansenula poiymorpha autonomously replicating sequence HARS1 is cloned into the Sail site of 
Ylp5 as disclosed in Roggenkamp et al. (1986) to result in pHARS. Plasmid pHARS was then reconstructed 
by deleting and reinserting the 440 base pair HARS1-Sall fragment into the Pvull site of the same vector. 
The Sail sticky ends of the fragment were made blunt ended prior to ligation by incubation with 
exonuclease VII followed by reaction with Klenow polymerase. This reconstruction resulted in a single Sail 
site in the tetracycline resistance gene of the pBR322 part of the plasmid. The intermediate plasmid thus 
obtained was again reconstructed in order to obtain a beta-lactamase gene lacking the original signal 
sequence wherein the signal sequence is replaced by the linker shown below; for details, see Eckart (1988): 

Sail ECORI BstEII 



GTC 


GAC 


GAA 


TTC 


AAA 


ATG 


TCC 


GGT 


CAC 


CAG 


CTG 


CTT 


AAG 


TTT 


TAC 


AGG 


CCA 


GTG 












Met 


Ser 


Gly 


His 



B-lactamase 



This step yielded plasmid pME4 as shown in figure 1 . 



b) MOX and FMD promoter fragments. 

The isolation of the MOX promoter from H. poiymorpha is disclosed in Eckart, 1988. For the purposes 
of the present invention a genomic EcoRI-Safilragment of 1525 bp containing the upstream control region 
of the MOX gene and the codons for the five first amino acids of the MOX protein was cloned into a 
pBR322 derivative. A schematic representation of the MOX-gene is given in Fig. 2. This intermediate 
plasmid was cleaved using EcoRI and treated with nuclease Bal31 and subsequently ligated with EcoRI 
linkers. Thus a series of promoter fragments having EcoRI 3' ends and Sail 5 ends were obtained by 
subsequent cleavage with restriction endonuclease Sail. The positions of the various deletion endpoints 
were established by DNA sequencing according to the method of Maxam and Gilbert, 1977. For the 
purpose of further experiments described in this application a plasmid designated pBR322-MP containing a 
deletion up to position minus 3 (when the first base of the MOX translation initiation codon is base plus 1) 
was used. The plasmid pBR322-MP is described in Eckart, 1988. 

The isolation and characterisation of the FMD promoter is disclosed in European Patent Application 87 
110 417.0. For the purposes of the present invention a 1.4 kb BamHI fragment encompassing about 1000 
bp of the upstream promoter region was cloned into the BamHI site of pUC19. A clone was selected, in 
which the insert was orientated in a way to have the single EcoRI-site of pUC19 at the 3 end of the 
inserted fragment. A schematic representation of the genomic clone containing the FMD gene including its 
promoter is shown in Figure 2B. A series of nuclease Bal31 deletions starting from the EcoRI site within 
then pUC19-linker was prepared in order to obtain plasmids containing the promoter without sequences of 
the structural gene. Following Bal31 treatment the DNA was ligated to EcoRI linkers, the DNA was cleaved 
with BarnHi and the BamHI-EcoRI fragments were cloned into pBR322. A variety of deletions were 
obtained. Deletions to positions minus 5 and minus 9 from the first ATG were most efficient in later 
experiments. For the further experiments described in this application a plasmid carrying a promoter 
fragment containing the deletion to position -9 from the first ATG was used (plasmid pBR322-FMD-P-9). A 
schematic representation of the MOX- and FMD promoter fragments used for construction of further 
plasmids is shown below. 
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i) MOX promoter fragment (-3) 



Sail EcoRl 
-15 -3 

( 1500 bp) AAACTACAAAAAC GGAATTCC 

Linker 



ii) FMD promoter fragment (-9) 



BamHI EcoRI 
-14 -9 

( 1000 bp) TTCATC GGAATTCC 

Linker 



C) Terminator 

The isolation of a transcription terminator from the MOX gene is disclosed in Eckart, 1988. A schematic 
representation of the 3 region of the MOX gene is shown below: 



Arg Phe Stop 



AGA TTC TAA 

Sail Asp718 EcoRV Nrul 

< 1477 bp >< 327bp > 

< 284 bp X 368bp > 

The Sall-Nrul fragment shown in the scheme above was subcloned into pBR322 between the Sail and 
Nrul sites and this plasmid was then cleaved using restriction endonuclease Asp718 resulting in a 
linearisation of the plasmid due to a single Asp718 site within the MOX open reading frame, and subjected 
to Bal31 deletions. 

By DNA sequence analysis of the resulting fragments a terminator fragment of about 320 bp was 
identified (Eckart, 1988) which still functions as terminator. This blunt ended terminator fragment 
(GACATACC~315bp-EcoRV) was ligated into the Smai site of pUC19. Orientation of the fragment was 
established by sequence analysis (Maxam and Gilbert, 1977). The resulting clone pT-24 is shown in Figure 
3. This clone was used to construct expression vectors as shown below. 

d) Selective marker genes 

pME4 (described above, see Fig. 1) was cleaved with EcoRI and the sticky ends were filled in with 
Klenow polymerase. Plasmid pT-24 was digested with EcoRI and Hindlll and the fragment comprising the 
pUC19 linker and the MOX terminator fragment isolated. The sticky ends of this fragment were made blunt 
ended by treatment with Klenow polymerase and the blunt ended fragment ligated with pME4. The resulting 
plasmid was called pP/T-24. The cloning sites derived from pUC19 were later on used for further cloning 
steps. 
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Plasmid pP/T-24 contains the S. cerevisiae URA3 gene as a selective marker. As a further selective 
marker a gene conferring resistance~against chloramphenicol was introduced. This gene was isolated from 
vector pBR325 as a 1.7 kb Aatil-Clal fragment, subjected to a short Bal31 digestion for removal of about 
four base pairs on each side, and made blunt-ended using Klenow fill-in reaction. Subsequently the 
fragment was ligated with pP/T-24, which had been cleaved with Nrul. The resulting plasmid showing 
chloramphenicol resistance was called pP/T-24-C. 



Example A.2 

Construction of a plasmid containing the Hepatitis B surface antigen gene in functional combination with 
the Hansenula polymorpha terminator. 

In order to obtain a functional unit comprising the Hepatitis B S-gene and a terminator efficient in 
Hansenula polymorpha the S-gene was introduced as an Ncol-EcoRI fragment of 683 bp into the BamHI 
site of the plasmid pP/T-24C provided by the pUC19 linker portion of said plasmid. The 683 bp fragment 
was obtained from pRIT!2331 which is a conventional pUC9 based E coli vector containing the 683 bp 
Ncol-EcoRI DNA fragment encoding the S-protein from Ncol at the ATG codon (CC ATG Q) to EcoRI 
beyond the TAA stop codon ( TAA CGAATTC). The surface antigen coding sequence was derived by 
conventional recombinant DNA techniques from pRIT10616 disclosed in EP-A-0278940 and in Harford et 
al., 1987. pRIT10616 contains the genome of a HBV virus of adw serotype cloned on pACYC184 and was 
deposited with the American Type Culture Collection under the terms of the Budapest Treaty on June 2, 
1982 under Accession Number ATCC39131. 

The sticky ends of the S-gene fragment and of the BamHI cleaved pP/T-24C were made blunt ended 
by filling in prior to ligation. Ligation of the fragment in the proper orientation regenerates adjacent BamHI 
and Ncol sites on the construction immediately 5 to the S-gene coding region. The resulting plasmid is 
called pRB-S. 



Example A.3 

Construction of plasmids containing a functional expression cassette comprising Hansenula polymorpha 
derived promoters, Hepatitis B surface antigen gene and Hansenula polymorpha terminator. 

The pBR322 derivatives pBR322-MP and pBR322-FMD-P-9 containing the MOX or FMD promoters, 
respectively, were digested with EcoRI and the sticky ends were filled in using Klenow polymerase. The 
plasmids were then cleaved with Sail. The resulting fragment containing the MOX promoter comprises 
exclusively Hansenula polymorpha genomic DNA, whereas the fragment comprising the FMD promoter 
contains in addition 275 bp of pBR322 sequences (position bp 375 to bp 650 of pRB322 map). 

Plasmid pRB-S was cut at the regenerated BamHI site, the site was made blunt ended by filling-in with 
Klenow polymerase and subsequently the plasmid was digested with Sail. The promoter fragments having a 
Sail sticky end and a blunt end were ligated with the large Sail-blunt end fragment of pRB-S containing the 
S-gene coding region and the Hansenula polymorpha terminator. The resulting plasmids are called pRB-S- 
269 (MOX promoter) and pRB-S-271 (FMD promoter, minus 9 deletion) and contain the S-gene under 
control of the Hansenula polymorpha promoter and terminator as shown in Figure 4. The sequence 
surrounding the fusion between promoters and S-gene is shown in the schematic representation below. 
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a) MOX - promoter fusion (pRB-S-269) 

-3 +1 

< AAAAAC GGAATTGATCC ATG 

> 

MOX promoter S-gene 

b) PMD - promoter (-9) fusion 

-9 +1 

< TTCCATC GGAATTGATCC ATG 

> 

FMD promoter S-gene 

Plasmids constructed identically to pRB-S-271 and pRB-5-269, but lacking the insertion of the 440 bp 
restriction fragment carrying the HARS1 sequences were called pRB-S-269l and pRB-S-2711. These 
plasmids were used to obtain integrants containing 1 to 3 copies of the expression cassette. 



Example A.4 



Construction of integrative vectors containing the S-gene. 



a) Construction of plasmid pBC containing Hansenula polymorpha autonomously replicating sequences and 
H. polymorpha URA3 gene. 



Plasmid pBC is derived from plasmid YRp7 (Tschumper and Carbon, 1980) containing the Sac- 
charomyces cerevisiae ARS1 sequence. This autonomously replicating sequence was inactivated by 
insertion of a 5400 bp Bglll-Sphf fragment encompassing the Hansenula polymorpha URA3 gene. Further- 
more the Hansenula autonomously replicating sequence 1 (HARS1) was cloned into the Sail site of the 
tetracycline resistance gene (Fig. 5). 



b) Insertion of expression cassettes into plasmid pBC. 
Plasmid pRB-S-269 was digested with BspMII and Sail. 

The resulting fragment bearing the expression cassette comprising the MOX promoter, the Hepatitis B 
surface antigen encoding region and terminator was made blunt ended and ligated to plasmid pBC following 
cleavage with Xhol and Stul and filling in of the Xhol sticky ends. The resulting plasmid pMS-2 contains the 
S-gene expression cassette surrounded by flanking sequences of the Hansenula polymorpha URA3 gene 
fragment. 

For integration of the S-gene expression cassette into the Hansenula polymorpha genome plasmid 
pMS2 was cleaved with Nhel and the resulting fragment of 6.1 kb was used to transform the cells. 

Vector pFS-9 was constructed in an analogous way except that the expression cassette was isolated 
from pRB-S-271 containing the FMD promoter. 

A detailed description of the resulting plasmids is provided by way of Figure 5. 



Example A.5 
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Construction of plasmids comprising the S-gene expression cassette and an additional selective marker 
gene. 

5 a) Construction of the basic plasmid pHK2 

For construction of plasmid pHK2 (Fig. 6; deposited under the terms of the Budapest Treaty with the 
DSM (DSM DEUTSCHE SAMMLUNG VON MIKROORGANISMEN UND ZELLKULTUREN GmbH) under 
Accession number 5328 on April 27, 1989) the kanamycin resistance gene was isolated from transposon 
io Tn5 (Grindley and Joyce, 1980). Unnecessary sequences were deleted by treatment of the structural gene 
with nuclease Bal31. The resulting fragment comprising the kanamycin structural gene including the 
kanamycin terminator was ligated with the Saccharomyces cervisiae ADH1 promoter (Hitzeman et al. 1981). 
Furthermore the HARS1 sequence was introduced as a 440 bp Sail fragment into the plasmid. 

15 

b) Construction of kanamycin resistant strains derived from pRB-S-269 and pRB-S-271. 

The Hindlll fragment from pHK encompassing the kanamycin resistance coding region (3.5 kb) was 
cloned into the Hindlll site present in the HARS1 -sequence of plasmids pRB-S-269 and pRB-S-271. The 
20 resulting plasmids are called pRB-S-322 and pRB-S-326 respectively and are shown in Figure 7. 



Example A.6 

25 

Construction of a parental plasmid for expression of pre-S proteins. 

Plasmid pP/T-24 (Example A.2) was reconstructed In order to obtain a plasmid pMOX - P/T1 (Fig. 8) 
carrying a universal expression cassette comprising the MOX promoter, a multiple cloning site and the 
30 MOX terminator. 

In order to insert a linker containing restriction sites for restriction endonucleases Sail, EcoRI, Bglll and 
BamHI the pUC19 linker of plasmid pPfT-24 was partly replaced by synthetic DNA containing restriction 
sites for the above mentioned enzymes. 

Plasmid pP/T-24 was cleaved with restriction endonucleases BamHI and Sail. These sites originate from 
35 the pUC19 linker (BamHI) and pME4 (Sail). The linker molecule containing restriction sites for Sail, EcoRI, 
Bglll and BamHI was inserted. The resulting plasmid pT1 was digested with EcoRI and Sail and the MOX 
promoter was inserted as a 1.5 kb EcoRI-Sall fragment yielding plasmid pT2. The relevant areas of plasmid 
pT2 are shown below: 

40 sail EcoRI Bglll BamHI 



-1500 -3 

45 

/ / AAAAAC GGAATTC AGATCT GGATCC AGCTT 



MOX promoter Terminator 

55 Furthermore plasmid pT2 contains the Hansenula polymorpha ARS and Saccharomyces cervisiae URA3 
gene as in plasmid PRB-269 (Fig. 3). 

The next construction step was the replacement of the Saccharomyces cervisiae URA3 gene by the 
URA3 gene derived from Hansenula polymorpha . A clone containing this gene was isolated as described 
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below in example A.9. The URA3 gene was isolated as a 1.2 kb BamHI-Bglil fragment. Whereas the Bglll 
site is an original genomic site, the BamHI site was the result of ligating a Sau3A site with a BamHI site 
used to construct the genomic library. The sticky ends of this BamHI-Bglll fragment were filed in by Klenow 
polymerase and inserted into the Aval site of pBR322, which also had been filled in with Klenow 
5 polymerase. The Aval site was regenerated by the ligation and the resulting plasmid was designated 
pURA3-P1. 

Plasmid URA3-P1 was digested with Nrul and BspMII. These sites originate from pBR322, positions bp 
974 and bp 1664 of the conventional pBR322 map. Plasmid pT2 was subjected to a complete digestion with 
Nrui (bp 974 of pBR322) and partial digestion with BspMII. 

10 There are 2 BspMII sites in pT2: The first in pBR322 (bp 1664), the second is located in the MOX 
terminator. The Nrul-BspMII fragment of pT2 (1800 bp) contains the URA3 gene of Saccharomyces 
cerevisiae . The Nrul-BspMII fragment from URA3-P1 carrying the Hansenula polymorpha URA3 gene was 
cloned into pT2 replacing the S . cerevisiae URA3 gene located on the corresponding Nrul-BspMII 
fragment. The resulting plasmid pT3 contains the MOX terminator, a linker, the MOX promoter and the 

75 Hansenula polymorpha URA3 gene following one another. Plasmid pT3 lacks however a functional j8- 
lactamase gene as in parental plasmid pME4. To replace this defective ^-lactamase gene by a functional 
gene pBR322 was digested with EcoRI, the sticky ends fiiled-in with Klenow polymerase and ligated to a 8 
bp Asp718 linker followed by digestion with Pvul (position 3738 of the pBR322 map). The 633 bp Asp718- 
Pvul fragment thus obtained bears the original promoter and translation initiation codon of the ^-lactamase 

20 gene. 

Plasmid pT3 was subjected to cleavage by Asp718 (the Asp718 is located at the end of the MOX 
terminator) and to a partial digestion with Pvul. The Asp718-Pvul fragment obtained as described above was 
then inserted into the pretreated pT3 yielding the plasmid pMOX-P/T1. 

Plasmid pMOX-P/T1 is shown in Figure 8. This plasmid contains a functional beta-lactamase gene. 



Example A.7 



30 Construction of plasmids expressing pre-S proteins. 

The pre-S gene encoding the preS1-S2-S-protein (large protein) of Hepatitis B virus was obtained as a 
Ncol-Hindill fragment from pRIT12816. pRIT12816 is a conventional pBR322 based E. coli vector containing 
a 1185 bp Ncol-Hindltl fragment encoding the preS1-S2-S-protein from Ncol at the ATG codon (CCATGG) 

35 to Hindlll at 15 bp beyond the TAA stop codon (....AAGCTT). The preS1-S2-S-coding sequence fragment 
was derived by conventional cloning techniques from pRIT10616 disclosed in EP-A-0278940 and in Harford 
et al ., 1987. A 861 bp fragment encoding the preS2-S sequence can also be recovered from this by 
conventional means. The Ncol-Hindlll fragment was made blunt ended by filling-in with Klenow polymerase 
and cloned into the also made blunt ended EcoRI site of pMOX-P/T1. As a consequence of correct ligation 

40 the EcoRI site in front of the preS1 structural gene is regenerated by Ncol/EcoRI blunt ended ligation. The 
construction expressing the pre S-gene under control of the MOX promoter is colled pMPS-22. This is 
shown in Figure 8. 

The analogous construction containing the FMD promoter is called pMPS-21. 

45 

Example A.8 



Construction of a vector containing a preS expression cassette. 

50 

The Sall-Asp718 expression cassette from plasmid pMPS22 was made blunt ended by filling in and 
inserted between the filled in Xhol site and Stul sites of plasmid pBC described in Example A4a above. The 
resulting plasmid is called pMPS-9 and shown in Figure 9. 

55 

Example A.9 
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Cloning of the Hansenula polymorpha URA3 gene. 

The URA3 gene of Hansenula polymorpha was cloned by complementation of the corresponding pyrF 
mutation of E. coii strain MB 1000. 

6 Genomic~DNA fragments ranging in size from 6 to 9 kbp were isolated via partial digestion with Sau3A, 
separation on a low melting agarose gel and isolation of the appropriate DNA fraction. 

The purified fragments were ligated into the unique BamHI site of vector YRp7, containing an 
autonomous replicating sequence of Hansenula polymorpha inserted into the Sail site. The resulting ligation 
mixture was used to transform E. coli strain MB 1000 and transformants were selected on minimal plates 

w without uracil. Transformants obtained were analysed and the piasmids were isolated. A sub-fragment as 
shown in Fig. 10 was isolated, which mediated the restoration of the URA* phenotype in E. coli , 
Saccharomyces cerevisiae and Hansenula polymorpha orotidine 5 -monophosphate decarboxylase mutants. 
The structure of the cloned DNA was shown to be identical to that of the genomic URA3 region by Southern 
blotting. A restriction map of the Hansenula polymorpha URA3 gene fragment is shown in Figure 10. 

75 

PartB 



20 

Expression of Hepatitis B surface Antigen (HBsAg) in Hansenula polymorpha 



Example B.1 

25 

Construction of Hansenula polymorpha strains containing the S-gene 

Hansenula polymorpha RB 10 was transformed with the above described piasmids pRB-s-269, pRB-S- 
30 269-1, pRB-S-271. pRB-S-271-l. Several clones bearing automonously replicating piasmids (transformants) 
or integrated foreign DNA (integrants) were obtained. 

These transformants/integrants were then tested for the expression of S-gene using immuno assays. 
Two test systems were used: Western blotting and AUSRIA or AUSZYME tests (Abbott). The production of 
HBsAg monomer was analysed by Western blotting using monoclonal RF6 antibodies. Abbott's AUSRIA or 
35 AUSZYME Kits were used according to the instructions of the manufacturer to estimate the amount of 
HBsAg particles produced in Hansenula . Transformants showing a stable and high expression level were 
selected and their DNA analysed by Southern blotting following electrophoretic separation. 



40 a) Transformants 

Several colonies bearing the freely replicating plasmid were obtained from Hansenula polymorpha 
transformed with piasmids pRB-S-269 and pRB-S-271. Southern analysis of the total DNA obtained from 
these transformants digested with different restriction endonucleases showed the expected restriction 
46 pattern. 



b) Integrants 

so Hansenula polymorpha can be transformed with the integrative vectors pRB-S-269-l and pRB-S-271-l at 
a high frequency. Integrants grown on complete medium (YEPD) are mitotically stable. Alternatively, 
mitotically stable maintenance of a foreign DNA in Hansenula can also be obtained by the procedure 
described above, wherein an autonomously replicating vector, preferably a high copy number plasmid such 
as pRB-S-269 or pRB-S-271 integrates spontaneously into the genome. The resulting integrants also show 

55 high mitotic stability of the foreign DNA. 

Both methods mentioned above were used to obtain strains efficiently expressing the S-gene in non- 
selective complete medium. 3 strains were retained; the strain having the identification number 352, which 
was transformed with pRB-S-271, wherein the S-gene is regulated by the FMD-promoter, and strains No. 
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415 and 416, which were transformed with pRB-S-269 containing a MOX regulated S-gene, were used for 
further experiments. 



5 Example B.2 



Mitotic stability of foreign DNA in integrants 

ro The stability of the recombinant strains 352, 415 and 416 was tested by the following procedure: Strains 

were grown In non-selective SMR medium for 40 generations. Twelve independent colonies were isolated 

from the cultures before and after the 40-generation period. 

The structure of the foreign DNA and the copy number of the expression cassette were examined in the 

isolated clones by Southern blot analyisis in combination with agarose gel electrophoresis of restriction 
T5 fragments and Pulsed Field Gradient Gel Electrophoresis of total chromosomes. The latter technique allows 

the separation and analysis of yeast chromosomes (Schwartz and Cantor, 1984). 



Pulsed Field Gel Electrophoresis 

20 

Cells from strains 415, 416 and 352 were subjected to gentle lysis in the sample wells of an 0.7% w/v 
agarose gel. The in situ liberated chromosomes were separated using an LKB-Pharmacia Pulsaphor Plus 
apparatus according lo"the instructions of LKB-Pharmacia. Following PFGE the DNA was transferred to 
nitrocellulose according to the procedure of Southern (1975). The blot was submitted to hybridisation using 

25 ^P-labelled probes specific either for the MOX and/or FMD gene or the URA3 gene of Hansenula 
polymorpha , the latter being known to be present as a single copy in the genome. 

Comparison of the signals derived from possible multimeric integrants with those of single copy genes 
{intrinsic markers) allowed an estimation of the copy number of the relevant expression cassettes present 
inside the cell. The analyisis of chromosomes by PFGE demonstates that clone 415 has about 30 to 50 

30 copies of the expression cassette, whereas clone 416 contains about 5-6 expression cassettes and clone 
352 about 10 expression cassettes. The analysis showed that the foreign DNA is integrated into the 
Hansenula genome. 



35 Southern Analysis 

Strains 352, 415 and 416 were grown for 40 generations in SMR medium. Twelve independent colonies 
were isolated from each batch before and after the above mentioned growth period. DNA preparations from 
these sub-clones were then subjected to Southern analysis after cleavage with various restriction enzymes 
40 and using ^P-labelled DNA-probes. Analysis of the restriction patterns showed that the expression 
cassettes were intact and that the introduced DNA was genetically stable. The analysis further revealed that 
the genome of strain 415 contains long repeats with several copies of the plasmid. The plasmid is deduced 
to be arranged in this repeat in a head to tail manner. 

45 

Example B.3 



Expression studies 

50 

Recombinant strains were routinely analysed for the presence of HBsAg by Western Blotting and 
AUSRIA/AUSZYME tests. 



55 a) Growth of Cultures and Western blotting 

Strains 352, 415 and 416 were grown in 11 flasks in 200 ml SMR-glycerol medium at 37°C with vigorous 
aeration. At the beginning of the stationary phase, which is characterised by the depletion of glycerol, 50 
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ml/l of a five fold concentrated SMR medium lacking glycerol was added together with methanol to a final 
concentration of 10 g/l. Samples of 10 ml volume were taken from the culture prior to addition of methanol 
and at 24 hours following addition of methanol. For the purpose of comparison the same strains were grown 
also in SMR medium containing 30 g/i glucose (repression of promoters). The cells were harvested in the 
s late log-phase. 

From the harvested cells crude protein extracts were prepared by suspending a cell pellet correspond- 
ing to about 0.1 g dry weight in 5 ml of PE buffer (0.5 M NaCl f 0.1% Triton X-100, 10 mM phosphate buffer 
pH 7.5, 2 mM PMSF). Approximately an equal volume of glass beads of 0.45 mm diameter was added until 
the mixture because almost solid. The mixture was subjected to vigorous shaking for 4 minutes at 4°C 

70 using a Braun MSK homogenisation (B. Braun and Diessel Biotech GmbH, Melsungen, FRG). Subsequently. 
4 ml of PE buffer was added and the homogenate was mixed and centrifuged 5 minutes, at 4°C, 40,000xg. 
Aliquots of 5 to 25 ug of the supernatant were subjected to SDS-polyacrylamide gel electrophoresis, using 
purified HBsAg (0.05 to 0.5 ug) as a control. The presence of antigenic material was demonstrated by 
transferring the protein onto nitrocellulose according to the procedure of Towbin (1979) and detection with 

75 the monoclonal antibody RF6 as described in Method 5a above. The Western blots allow an estimation of 
the amount of antigen produced in clones 352, 415 and 416, by reference to the known amounts of pure 
yeast-derived HBsAg. 

The results are summarized in Table 1. In methanol induced flask-cultures (SMR medium) with a cell 
density of about 5 g dry weight/I the S-protein constitutes about 2 to 5% of the total extracted protein of 
20 transformed strains. 

The comparison of expression in cells grown under conditions of induction (methanol), derepression 
(glycerol) and repression (glucose) demonstrates a very stringent control of the MOX and FMP promoters. 
Expression is completely blocked in glucose grown cells. In glycerol, synthesis of HBsAg is about 30% of 
the level obtained in induced cells. 

25 

b) Antigenicity of HBsAg expressed in Hansenula polymorpha. 

The antigenicity of the material contained in crude extracts was measured using an Abbott AUSZYME 
30 test based on monoclonal antibodies. The results were expressed as arbitrary units (absorbance at 492 nm 
per 0.1 yg protein). Dilution of crude extracts and the reaction were both performed in 150 mM NaCI, 25 
mM phosphate buffer pH 7.4, 0.01% BSA, 0.01% Tween 20. The reactivity with the AUSZYME test 
indicates the presence of conformational epitopes characteristic for sub-viral particles. 

35 
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TABLE 1 HBsAg production in strains 

expression S-gene 







HBsAg 


HBsAg 






estimated 


estimated 






by Western 


by AUSZYME 




Carbon 


blotting 


assay 


Strain Promoter 


Source 


mg/lOOmg 


arbitrary 






protein 


A4Q0 units 








0 . lpg protein 


352 FMD 


Glucose 


No signal 


0.3 




Glycerol 


0.8-1.2 


10.0 




Methanol 


2.5-3-0 


25.0 




Glucose 


No signal 


0.1 


415 MOX 


Glycerol 


1.0-1.5 


12.0 




Methanol 


4.0-5.0 


45.0 




Glucose 


No signal 


No signal 


416 MOX 


Glycerol 


0.5-0.8 


5.0 




Methanol 


2.0-3.0 


22.0 



c) Density gradient centrifugation of crude extracts of H. polymorpha 

The crude extracts obtained according to the procedure described above were subjected to density 
gradient centrifugation either using CsCI or sucrose gradients. Sucrose equilibrium sedimentation gradient 
centrifugation was performed by applying 100 to 200 ug (200 ul) of the crude protein extract on top of a 20 
to 50% sucrose gradient in 50 mM NaCL 2 mM EDTA, 25 mM NaPO*. pH 7.2. Tubes having a volume of 5 
ml were centrifuged for 18 hours at 40,000 rpm in a Beckman SW 50.1 rotor. 

Cesium chloride gradient centrifugation was performed by diluting 1 volume of the crude protein extract 
with 1 volume of 3 M CsCI in 25 mM phosphate buffer pH 7.4 Centrifugation was performed at 4°C, 40,000 
rpm in a Beckman 50 71 rotor for 40 hours. 

The gradients were subjected to fractionation and the various fractions tested using RF6 antibodies in 
Western blotting or using AUSRIA and/or AUSZYME assays. It could be shown that a peak of HBsAg 
material with a density of about 1.16 to 1.19 g/ml is formed in a CsCI density gradient. The material present 
in this peak reacts very well not only with RF6 antibodies in Western blots but also with the AUSRIA and 
AUSZYME assays. This density as well as reactivity in AUSRIA/AUSZYME tests indicate the presence of 
conformational epitopes characteristic for particles. Moreover, in a further experiment the density of this 
material was compared to sub-viral particles isolated from human serum. It could be shown that both 
materials are found at a comparable density by isopycnic centrifugation in CsCI solutions. 

Sucrose gradient centrifugation of crude extracts gave peak fractions, which reacted with AUSRIA and 
AUSZYME and which constituted at least 80% of the HBsAg as shown by analyses of gradient fractions in 
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parallel with AUSRIA/AUSZYME and Western blotting. The residual 20% of the material is found mainly in 
the top and bottom fractions of the gradient and react only in Western blot analysis, indicating that this 
material constitutes the monomeric form of the protein. Analysis of peak fractions obtained from CsCI 
gradients also revealed that at least 80% of the material forms particles. 

The Hansenula derived particles are resistant to proteolysis. Incubation of crude protein extracts 
containing particles for several hours at various temperatures reveals only a very minor degradation as 
shown by testing the AUSZYME values and analysing the intactness of the polypeptide by denaturating 
SDS-PAGE followed by Western blot. The material was further analysed by electron microscopy. The 
electron microscopy pictures clearly show particles with a diameter of about 22 nm. 

The above described results were obtained for all 3 strains (352. 415 and 416). 



Parte 



Expression of preS1-S2-S protein in Hansenula polymorpha 



Example C.1 



Conduction of Hansenula polymorpha strains containing the preS1-S2-S gene 

Hansenula polymorpha RB 10 was transformed with the 5570 bp Pvul-Asp718 fragment derived from 
pMPS-221 encompassing the URA3 gene and preS1 gene under control of the MOX promoter (see Figure 
8). Furthermore, the corresponding fragment carrying the preS1 gene under control of the FMD promoter ( 
-9 deletion) was obtained from plasmid pMPS-21. The integrants resulting from these transformations were 
shown to contain 1 to 2 expression cassettes by Southern analysis. Integrants containing the preS1 gene 
under control of MOX promoter are referred to as preS-AM; integrants containing the preS gene under 
control of FMD promoter are referred to as preS-AF. 

integrants containing several expression cassettes were obtained by transforming Hansenula polymor- 
pha with autonomously replicating plasmids pMPS22 and pMPS21 bearing the HARS sequence. Integrants 
wire formed by spontaneous integration of these plasmids as described in "Materials and Methods", item 
2b. These types of transformants were given the general designation preS-BM (MOX promoter) and preS- 
BF (FMD-promoter). 

Strains preS-AM-405, preS-BM-402, preS-BM-403 and preS-BM-454 wherein expression of the preS1- 
gene is regulated by the MOX promoter were retained for further analysis. 



2. Mitotic stability 

Mitotic stability of the foreign DNA present in the four strains identified in Example C.1 above was 
tested by Southern analysis of 12 independent subclones isolated before and after a 40 generations growth 
period as described in PART B. The analysis confirmed the genetic stability of the integrated vectors (see 
also an example in PART D.3). 



3. Expression of preS protein in Hansenula polymorpha 

Strains preS-BM-402, preS-BM-403, preS-AM-405 and preS-BM-454 were grown on SMR medium 
containing glycerol followed by induction of the cells with methanol (10 g/l) extractly as described in Part B. 
Cells were harvested 25 hours following addition of methanol and crude extracts were prepared as 
described in Part B. 

a) 12 ixg amounts of crude extracts of strains preS-BM-402, preS-AM-405, and preS-BM-454 were 
analysed by polyacrylamide SDS-gel electrophoresis followed by Western blotting using a mixture of 
monoclonal antibodies (S1.1. S2.2 and S2.5) as described in "Methods 5 (a)". Both types of strains showed 
an antigenic double band at 38/39 kD and a further band at 45 kD. 
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Strain preS-AM-405 showed a lower content of the 45 kD species relative to the 39 kD materials than 
did strain pres-BM-402; strain preS-BM-454 showed the highest relative content of the 45 kD species. 

The electrophoretic pattern of preS-protein from the above strains was compared with antigen derived 
from human serum particles. Both human and yeast preparations show a band at about 38/39 kD when 
s compared on Western blots. 

b) A variation in the expression level for different strains was also seen. These differences were 
presumably due to variations in copy number of the expression cassette between the strains. The preS- 
antigen was estimated by Western blotting to constitute 0.1 to 1.5% of the total cell protein in different 
preS-A and pres-B strains as given in Table 2. 
10 Crude extracts were also tested with the AUSZYME assay for the presence of particles. The results as 
shown in Table 2 demonstrate a very low reactivity expressed as A492 units per 0.1 ug protein. 

TABLE 2 



Expression of preS in H. polymorpha 


Clone 


Estimated PreS protein by 


AUSZYME A«2 units 




Western blotting mg 


per 0.1 ug protein 




preS1/100mg 




preS-AM-405 


0.1-0.2 


0.01 


preS-BM-402 


0.3-0.5 


0.02 


preS-BM-403 


0.4-0.6 


0.02 


preS-BM-454 


1.2-1.5 


0.01-0.2 



25 

c) Glycosyiation: Strain preS-BM-454 characterised by high production of preS protein (about 1 .5% of 
total cell protein) and a pronounced 45 kD band was chosen for further studies on glycosyiation. PreS 
protein from preS-BM-454 was analysed by incubating the crude protein extracts with EndoH en- 

30 doglycosidase. The following data clearly show that the 45 kD species is a glycosylated form of preS 
protein. 

Upon incubation of 45 ug crude protein for 0.5, 1.0 and 24 hours with 0.5 mU EndoH endoglycosidase 
according to the instructions of the manufacturer (Boehringer Mannheim, FRG), the 45 kD band shifted to 
the apparent molecular weight of 42 kD. This shift indicates that the 45 kD species is a N-glycosylated form 

35 of preS protein, bearing one 'core 1 group of about 3000 D. 

Tunicamycin, known to be a potent inhibitor of N-glycosylation, was also used to study the glycosyia- 
tion of preS protein in Hansenula polymorpha . Strain preS-BM-454 was grown to a density of A600 = 10.0 
in YNB medium containing glucose (10 g/l). 10 ml of this culture was inoculated into 100 ml of YNB 
containing 10 g/l methanol and 50 ug/ml tunicamycin. The pH value of the medium was kept constant at 7.2 

40 to 7.5. A control experiment was conducted without use of tunicamycin. The cells were harvested 10, 15 
and 30 hours following inoculation of the medium containing tunicamycin. 

Crude protein extracts were analysed by Western blotting. The studies demonstrated that in the cells 
grown in the presence of tunicamycin a single band at 42 kD and a double band at 38/39 kD are visible. In 
the control experiment the 42 kD band was replaced by a 45 kD band. Similar results were obtained with 

45 extracts from strains preS-BM-402, preS-BM-403 and preS-AM-405, which however produce much lower 
amounts of the 45 kD protein species (not more than 5 to 10% of total pre-S-protein synthesised). The 
results suggest that the 42 kD band represents an O-glycosylated form of preS-protein. 

d) Density gradient centrifugation 

Crude extracts from strain preS-AM-405 and strains preS-BM-402, preS-BM-403, preS-BM-454 were 
so subjected to analysis by sucrose and CsCI gradient centrifugations. The conditions were described in Part 

B 3(c). The gradient fractions were tested by Western blotting and by AUSZYME tests. 

The analysis revealed that in contrast to strains expressing S antigen (PART B) or both preS1 and S 

antigens (PART D), the preS-strains do not efficiently form sub-viral particles. Neither isopycnic CsCI nor 

sucrose gradient centrifugation revealed formation of a band of the required density. This was also 
55 confirmed by the fact that the HBsAg-related material from gradient fractions and crude extracts reacted 

poorly with the AUSRIA or AUSZYME tests. 
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PART D 



Formation of composite particles containing both preS and S antigens 



Example D.1 



Construction of strains 

Several strains were constructed which are characterised by different ratios of preS1 to S expression 
and different total expression levels. 

These differences were achieved by variation of the copy number of the expression cassettes per 
genome. Additional variability was obtained by placing the genes under tht control of different Hansenula 
promoters. 

Three basic types of recombinant strains were obtained as follows: 

a) Strains containing one copy of the preS1 gene and one copy of the S gene (A - strains) 

The 6.6 kb Nhel DNA fragment containing the preS1 cassette and the URA3 gene was isolated from 
plasmid pMPS9 (Figure 9). A Nhel DNA fragment containing the S gene was isolated form plasmid pMPS2 
(Figure 5). The fragments were ligated using T4 DNA ligase and joined fragments consisting of one copy of 
each of the genes were isolated from agarose gels. These fragments were used to transform Hansenula 
polymorpha strain RB10 to uracil independance. 

The resulting transformants were analysed by Southern blotting and clones containing one integrated 
copy of each cassette were selected. Strain preS/S-A-293 was retained for further analysis. 

b) Strains containing one preS expression cassette and several S expression cassettes (B - strains) 

Hansenula polymorpha was transformed with the 5570 bp Asp718-Pvu! fragment from pMPS-22 
containing the preS1 expression cassette. Transformants bearing one integrated copy of the cassette were 
selected. The strain retained was preS-AM-405 described in part C.1 above. 

Strain preS-AM-405 was then transformed with autonomousiy replicating plasmids PRB-5-326 (FMD- 
promoter) or pRB-S-322 (MOX-promoter) (Figure 7) containing the S gene under control of the respective 
promoters. Both plasmids in addition bear a gene encoding resistance against gentamycin G418 as a 
selective marker. The Kan gene is under the control of the Saccharomyces cerevisiae ADH1 promoter 
(Figure 7) as described~above. Transformation yielded several strains containing always one integrated 
copy of the preS1 cassette and multiple integrated copies (30 to 100) of the S gene containing cassette as 
shown by Southern blot and PFGE analyses. 

Strains preS/S-B-431 , preS/S-B-432 and preS/S-B-433 were retained for further analysis. Strain preS/S- 
B-431 carries a MOX - promoter controlled S-gene, whereas preS/S-B-432 and preS/S-B-433 contain 
expression cassettes comprising the FMD promoter. 

c) Strains containing several expression cassettes of both the S and preS genes (C-strains) 

Hansenula polymorpha was transformed with the autonomously replicating plasmid pMPS-22 containing 
the pre-S1 gene (Figure 8). Transformation yielded several strains containing variable numbers of expres- 
sion cassettes integrated into the genome. Isolates containing 2 to 20 preS1 expression cassettes were 
identified by Southern blotting. Those strains were described in Part C as preS-BM-402, preS-BM-403 and 
preS-BM-454 and express different amounts of preS antigen. These preS strains were then again 
transformed with autonomously replicating plasmids pRB-S-322 (MOX - promoter) and pRB-S-326 (FMD 
-promoter), which bear the S-gene (Figure 7). Resistance against G418 was used as a transformation 
marker. Selection for stable integrants yielded several clones for each transformation some of which were 
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further analysed Strains retained for further analysis were preS/S-C-452, preS/S-C-465 derived from strain 
preS-BM-402, strain preS/S-C-466 derived from strain preS-BM-403, strain preS/S-C-448-C4 derived from 
strain preS-BM-454 wherein the S-gene expression cassettes comprise the FMD promoter. Table 3 
provides detailed information about the origin and designation of the above described strains. 
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Example D.2 

5 

Expression of preS and S-proteins in hL polymorpha 

Expression of preS and S-proteins in different transformant strains of HL polymorpha was tested by 

10 immunoblotting as described above in Materials and Methods 5a and by the AUSZYME assay. A summary 
of the results and the properties of the various strains tested is given in Table 3. 

Strains expressing both the preS and S-proteins were grown and induced as described in part B item 3 
except that 30 ml culture volumes were used. Crude cell extracts wer prepared as described above and 
subjected to SDS-polyacrylamide gel electrophoreses followed by Western blotting using RF6 or a mixing 

15 of S1.1 and S2.1 monoclonal antibodies for detection. 12 to 15 ng amounts of crude extracts from methanol 
induced cells of strains preS/S-A-293, preS/S-B-431 , preS/S-B-432, preS/S-B-433, preS/S-C-453, preS/S-C- 
465, preS/S-C-466 and preS/S-C-448-C4 were immunoblotted using monoclonal RF6 for detection. This 
analysis demonstrates that a spectrum of strains can be obtained which express different ratios of preS to 
S-protein. From these and other immunoblots the ratio of preS to S-protein in strain preS/S-B-431 was 

20 estimated to be about 1:15, in strain preS/S-C-452 to be about 1:8 and in strain preS/S-448-C4 to be about 
5:3. The strains were found to differ also in their level of productivity both by immunoblotting and by 
AUSZYME assay as shown in Table 3. Cell extracts from these strains were also immunoblotted using 
monoclonal S1.1 for detection. In most strains the preS protein is detected as the doublet protein band at 
38-39 kD with lesser amounts of the 45 kD band (which is estimated to comprise only about 5% of the total 

25 preS-protein detected). In contrast strain preS/S-C-448-C4 expresses approximately equivalent amounts of 
the 38-39 kD and 45 kD species. 



Example D.3 

30 

Stability of the foreign DNA 

The stability of the foreign DNA in the above described preS/S transformants and integrants was 
35 analysed as described in part B. The studies indicate a very good mitotic stability of the recombinant 
clones. Southern analysis of 12 independent colonies isolated before and following a 40-generation 
fermentation of strains preS/S-453 and preS/S-431 demonstrated that in all cases the same restriction 
fragment pattern was visible. The preS/S strains listed in Table 3 were also analysed by Pulsed Field 
Gradient Electrophoresis. The separated chromosomes were blotted onto nitrocellulose membranes and 
40 then hybridised with different radioactive DNA probes. The procedure allows estimation of the number of 
copies of the S-expression cassette and proved conclusively that these preS/S strains are integrants. The 
results of these experiments are summarised in Table 3 above. 



45 Example D.4 



Isolation and characterisation of composite particles 

50 

a) Partial purification of composite particles from Hansenula polymorpha. 

A fermenter culture of Hansenula polymorpha strain preS/S-B-431 was grown in S-medium described 
above in Materials, 5d and the cells recovered by centrifugation after 78 hours induction with methanol. The 
55 cell pellet was resuspended at a concentration of about 120 g dry cell weight per litre in buffer containing 
0°5% (w/v) Tween 20, 2 mM EDTA, 4 mM PMSF, 5% (v/v) isopropanol and 50 mM sodium phosphate 
buffer pH 9.0. Cells were broken in a bead mill (Dynomili Type KDL, Bachofen AG, Basel, Switzerland) 
containing 0.45-0.7 mm diameter glass beads using 4 passages at a flow rate of 6 liters per hour and a 
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residence time of 7 minutes in the chamber. 

The cell extract was clarified by centrifugation for 45 minutes at 16000 g at 4°C and the supernatant 
recovered. 

1% (w/v) colloidal silica (Aerosil 380, Degussa, Frankfurt, FRG) was then added to the supernatant with 
5 stirring and the mixture was stirred overnight at 4°C to permit adsorption of HBsAg. The silica was 
recovered by centrifugation for 30 minutes at 4000 g at 4°C and washed twice with 0.9% (w/v) NaCI by 
resuspension of the pellet and recentrifugation. HBsAg adsorbed to the silica was desorbed by resuspen- 
sion of the washed pellet in 10 mM sodium pyrophosphate pH 9.5 and incubation for 3 hours at 37°C with 
constant stirring. The volume of sodium pyrophosphate buffer added was about one eighth to one tenth of 
10 the initial clarified cell extract volume. The solution was then centrifuged at 17,000 g for 60 minutes at 4°C 
and the supernatant recovered. 

The supernatant was made 1 .5 M with CsCI and the mixture centrifuged to isopycnic equilibrium for 70 
hours at 45000 rpm in a 50.2 Ti rotor (Spinco Division, Beckman Instruments, Palo Alto, USA). The resulting 
CsCI gradients was fractionated and fractions tested for the presence of HBsAg using an Elisa kit 
T5 (Enzygnost, Behring-Werke AG, Marburg, FRG) as described by the manufacturer. Peak fractions were 
pooled and dialysed against 10 mM sodium/potassium phosphate, 0.15 M NaCI pH 6.8. Assays made on 
this material and the crude cell extract showed that 48%of total AUSTRIA reactive material, less than 3% of 
total proteins, less than 0.2% of total sugars and less than 2% of total lipids were recovered in this 
procedure. 

20 A cell pellet of Hansenula polymorpha strain preS/S-C-452 was also extracted by exactly the same 

procedure with similar results. 

Samples of the partially purified HBsAg preparations from the preS/S-B-431 and preS/S-C-452 strains 

were examined by gel electrophoresis and immunoblotting using monoclones HBS1 and S1.1 for detection 

of HBsAg related proteins as described in Part D.4 below. 
25 Immunoblotting with monoclonal HBS1 showed the presence of a reactive band at 24 kD and a further 

band at about 38/39 kD in both preparation. Immunoblotting with monoclonal S1.1 detected only the preS1- 

protein related band at 38/39 kD in both preparations. 

This shows that both the preSl-protein and S-protein species copurify through colloidal silica adsorption 

and isopycnic CsCI density gradient centrifugation. 

30 

b) CsCI density gradient centrifugation of composite particles. 

A fermenter culture of H. polymorpha strain preS/S-B-431 was grown in S-medium (Materials, 5d) and 
35 harvested after 73 hours induction with methanol. The cell pellet was recovered by centrifugation and 
resuspended and a cell extract prepared by passage through a French press as described below. Cellular 
debris was removed from the crude lysate by centrifugation for 30 minutes at 17000 g. Crude cell extracts 
were then centrifuged to equilibrium at 245,000 g in 1.5 M CsCI, phosphate buffered Saline pH 7.5. 
Gradients were fractionated and each fraction analysed for antigenicity using ELISA tests specific for HBsAg 
40 (Enzygnost : Behringwerke AG, Marburg, FRG) and for preS1 -protein. The Enzygnost tests measures only 
assembled HBsAg particles, not protein monomers. The presl-protein specific Elisa test employs murine 
monoclonal antibody S1 .1 for antigen capture and detection and is described below in Part D.5. 

A peak of HBsAg activity was observed at a density of 1.18-1.19 g/cm 3 in the CsCI gradient. A major 
peak of preS1 -protein activity was observed at this same density together with a minor peak at a density of 
45 1.26 g/cm 3 . 

The gradient fractions were also analysed for the presence of preS1 -protein S-protein by immunoblot- 
ting. After sodium dodecylsufate polyacrylamide gel electrophoresis according to Laemmli (1970) and 
blotting to nitrocellulose according to Towbin et al. (1979), the nitrocellulose sheet was incubated with a 
rabbit polyclonal serum raised against serum derived HBsAg. Detection of antibody binding was by the 
50 alkaline phosphatase method. 

A major protein band at 24 kD corresponding to the S protein and a double band at 38/39 kD were 
observed. The maximum activity in the immunoblotting for all three protein species was in CsCI gradient 
fractions with a density of 1.18 - 1.19 g/cm 3 which corresponds to the peak of HBsAg activity measured in 
the Enzygnost test. This indicates that the majority of HBV surface antigen proteins in the crude extract are 
55 present in a lipoproteic structure with a density characteristic of HBsAg. 

A further nitrocellulose sheet was incubated with murine monoclonal antibody S1.1. This antibody 
detected only the protein bands at 38/39 kD in fractions corresponding to a CsCI density of 1.18-1.19 g/cm 3 . 
The crude cell extracts were also subjected to rate-zonal centrifugation on a sucrose gradient. Crude cell 
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extracts were loaded on 5-20% (w/v) sucrose gradients made up in 50 mM Tris-HCI buffer pH 8.1 and 
centrifuged at 288,000 g for 200 minutes. Gradients were fractionated and the fractions analysed for the 
presence of HBsAg by the Enzygnost Elisa test and for the presence of preS1 -protein containing HBsAg by 
capture of antigen with the HBsAg specific antibodies from the Enzygnost test and detection with preS1- 
s protein specific Mab S1.1. The Enzynost assay showed a peak of HBsAg activity sedimenting through the 
gradient. 

The Mab.1.1 Elisa showed the same peak as the Enzygnost assay but with a shoulder of activity 
extending into higher sucrose densities. This shoulder of material with a higher sedimentation coefficient is 
eliminated when crude extracts from Hansenula polymorpha strain preS/S-B-431 were partially purified by 
w adsorption/desorption on colloidal silica and centrifugation in a CsCi gradient before analysis by sucrose 
density gradient centrifugation. These results show that both HBsAg and preS1 -protein cosediment in CsCI 
density gradients with a density characteristic of lipoproteic particles. 



15 c) Antigenicity of composite HBsAg particles expressed in H. polymorpha 

HBV surface antigen particles were partially purified from a crude extract of H. polymorpha strain 
preS/S-B-431 as described above in item 3a. This preparation was tested for reaction with murine 
monoclonal antibodies (Mabs) directed against epitopes located on the preS1 , preS2 and S regions of the 
20 surface antigen protein and for the presence of the receptor for polymers of human serum albumin (pHSA- 
receptor) on the preS2 region. 



C1. Reaction with MabS1.1 

25 

A murine monoclonal antibody (Mab) S1 .1 directed against a preS1 epitope of the HBV surface antigen 
protein was used to detect the presence of this site on the composite particle by an ELISA assay. MabS1 .1 
was coated on the wells of a microtitre plate (Immunoplate I; Nunc, Gibco Europe, Ghent, Belgium) and 
incubated with the partially purified crude extract. After washing to remove non-bound material, capture of 

30 reactive particles was revealed by incubation with MabSf.1 coupled to horse radish peroxidase by the 
method of Wilson and Nakame, 1978. 

Colour development to reveal second antibody binding was with orthophenylenediamine as chromogen. 
Absorbance was read at 490 nm (with 620 nm for the reference filter) in an Intermed Immunoreader, NJ 
2000 (Analis, Ghent, Belgium). 

35 The partially purified extract from H. polymorpha strain preS/S-B-431 gave a positive reaction in the test 
indicating capture by and binding MabS1.1 which is specific for the preS1 region. Surface antigen particles 
containing only S-protein purified from S. cerevisiae RIT4376 (Harford et al 1987) gave no reaction in this 
test 

40 

C2. Reaction with MabS2,5 

Murine monoclonal antibody S2.5 is directed against the preS2 region of the HBV surface antigen. This 
Mab was used in an ELISA test to determine the presence of the preS2 epitope on the composite particles. 

45 MabS2.5 was coated on wells of a microtitre plate and incubated with the partially purified cell extract from 
H. polymorpha preS/S-B-431 . Antigen binding was revealed by incubation with MabS2.5 coupled with 
horseradish peroxidase and colour development as described above. The partially purified cell extract from 
H. polymorpha preS/S-B-431 gave a positive reaction in the test indicating the presence of the preS2 region 
epitope on the surface antigen particles. Surface antigen particles containing only S-protein purified from S. 

so cerevisiae RIT4376 gave no reaction in the test. ~~ 



C3. Reaction with Mabs RF1 and S1.1 (HRP) 

55 Murine monoclonal antibody RF1 (H. Thomas, Royal Free Hospital, London, U.K.) is directed against a 
conformational dependant epitope located in the S region and was used in an ELISA test. This Mab was 
used to coat wells of microtitre plates and incubated with the partially purified extract of H. polymorpha 
strain preS/S-B-431 . Particle binding was revealed by incubation with the horseradish peroxidase coupled 
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S1.1 Mab described in section C.1 above. The partially purified extract from H. polymorpha gave a positive 
result in the test whereas surface antigen particles purified from S. cerevisiae*RIT4376 gave no reaction. 



s C4. Assay for the pHSA receptor 

An assay for the detection of the pHSA receptor present on the preS2 region was made according to 
the procedure of Pontisso et al. (1983). 

Polymerised human serum albumin was coated on the wells of a microtitre plate and incubated with the 
io partially purified cell extract of K polymorpha strain preS/S-B-431 . After washing to remove non-bound 
material, particles captured on the plate were detected by incubation with polyclonal anti-HBsAg rabbit 
antibodies labelled with I 125 (AUSAB kit). The partially purified extract from H. polymorpha gave a positive 
reaction in the test whereas HBsAg purified from S. cerevisiae RIT4376 gave no reaction in the test. 

The above results show that the HBsAg reactive material present in the partially purified extract from H. 
15 polymorpha contains antigenic sites specifically bound by Mabs directed against epitopes in the preSl, 
preS2 and S regions of HBV surface antigen and that this material also contains a site reacting with PHSA. 



d) Immune precipitation of composite particles. 

20 

To determine whether the AUSRIA reactive material present in a partially purified extract of H. 
polymorpha strain PreS/S-B-431 contained composite particles an immune precipitation with a monoclonal 
antibody specific for the preSl protein was performed. 

Immune complexes were captured by formalin-fixed Staphylococcus aureus (Staph A) cells 

25 (Immunoprecipitin, BRL, Gaithesburg, MD, USA) using rabbit anti-mouse serum as sandwich antibody 
between the monoclonal antibody and the Staph A cells. Staph A cells were pre-treated as recommended 
by the supplier and finally washed and resuspended at 10% (w/v) in PBS. The Staph A cells (120 ui of 10% 
w/v suspension) were then incubated with 20 ul of rabbit anti-mouse serum (RAM) for 2 hours at room 
temperature. The Staph A-RAM complex was collected by centrifugation, washed twice with PBS containing 

30 0.1% (v/v) Tween 20, and finally resuspended at about 10% (w/v) in PBS containing 0.1% (v/v) Tween 20. 

To a partially purified preparation of particles of H. polymorpha strain preS/S-B-431 obtained as 
described above and to a sample of yeast-derived S-prbtein HBsAg particles (lot HB193, Smith Kline 
Biologicals, Rixensart, Belgium) both containing 2 jxg AUSRIA reactive material in 500 ul was added 1 ul of 
MabS1.1 (638 ug protein per ml) and the mixtures were incubated for 2 hours at room temperature. 20nl of 

35 the Staph A-RAM complex was then added and incubation continued overnight at 4°C. Immune complexes 
formed were collected by centrifugation, washed 5 times with PBS containing 0.1% (v/v) Tween 20 and 
once with PBS before resuspending the final pellets in sample buffer for SDS polyacrylamide gel 
electrophoresis. The immune precipitates were analysed by immunoblotting as described below using 
mouse monoclonal antibody RF6 (H. Thomas, Royal Free Hospital, London) for detection of HBsAg protein 

40 species. Monoclonal antibody RF6 is directed against a denaturation and reduction resistant epitope of 
plasma derived HBsAg and competes in binding experiments with monoclonal HBSI. 

The immune blot showed that the immune precipitate obtained from the HL polymorpha preS/S-B-431 
particles contained not only the preSl -protein but also the S-protein while the immune precipitate from the 
Saccharomyces derived HBsAg contained no S-protein. 

45 This confirms that composite particles of mixed polypeptide composition are present in the material 
partially purified from H. polymorpha strain preS/S-B-431 cell extracts. 

e) Electron microscopy 

50 

Partially purified HBsAg particles from H. polymorpha strain preS/S-B-431 were examined by electron 
microscopy. Material was diluted in 20 mM Tris-HCI buffer pH 8.2 with 0.1% (w/v) bovine serum albumin 
and sedimented onto copper/rhodium grids covered with a film of celloidin before staining with uranyl 
acetate. Examination in the electron microscope showed the presence of spheroidal to spherical particles 
55 with a mean diameter of 27 nm. This size is within the range of mean particle diameters measured by the 
same technique for a series of HBsAg preparations purified from S. cerevisiae strain RIT4376 [Petre, J. et 
al., Postgrad. Medical Suppl. 2 , 73-81 (1987)]. 
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Example D.5 



5. Comparison of composite particles expressed in H. polymorpha and S. cerevisiae 

Cultures of strain preS/S-C-453 of H. polymorpha and S. cerevisiae strain Y1108 were grown in the S- 
medium as described in Materiais, 5d. Strain Y1108 is a diploid strain of S. cerevisiae expressing both the 
preS1 and S-proteins from expression cassettes integrated into the genome by Ty vectors. The strain 
contains 5-7 copies of a preS1 expression cassette and 3-4 copies of a S- expression cassette. The strain 

10 requires tryptophan for growth. Construction and chromosomal integration of such expression cassettes 
using Ty vectors is described in copending USSN 07/292 202 filed December 30, 1988 assigned to 
SmithKiine Beckman Coporation which is hereby incorporated as reference. Glycerol at 1.5% (w/v) was 
used as carbon source for growth of H. polymorpha . A series of 2 litre erlenmeyer flasks containing 400 ml 
of medium were inoculated with H. polymorpha strain preS/S-C-453 and incubated on an orbital shaker for 

is about 33 hours at 30* C until exhaustion of glycerol. The cells from three flasks were harvested by 
centrifugation and the cell pellet stored at -70°C until further processing. 100 ml of culture medium was 
removed from each of six further flasks and replaced with 100 ml of fresh medium containing 4% (w/v) 
methanol. Incubation was then continued. 

Three cultures were harvested by centrifugation after a further 14 hours incubation and a further three 

20 cultures harvested after 38 hours incubation in the presence of methanol. The cells were recovered by 
centrifugation and the cell pellet stored at -70°C until further processing. 

In a second experiment cultures of S. cerevisiae Y1108 were grown as described above and harvested 
after 19.5 hours incubation. Cultures of "R. polymorpha preS/S-C-453 were also grown as described above 
except that cells were harvested after 26 hours growth in glycerol medium and after 45 hours in methanol 

25 containing medium. 

Cell pellets of Hansenula polymorpha were resuspended at a concentration of about 25% wet cell pellet 
weight per volume in buffer containing 0.5% (w/v) Tween 20, 2 mM EDTA, 4 mM PMSF 
(phenylmethyisulphonylfiuoride), 5% (w/v) isopropanol and 50 mM sodium phosphate pH 9.0 10 ml of this 
mixture was passaged 6 times in a French Press at 20,000 psi to break the cells. The crude cell extract was 

30 then clarified by centrifugation at 17,000 g for 45 minutes at 4°C and the supernatant recovered. Cell pellets 
of §: cerevisiae were treated identically except that cells were resuspended at a concentration of about 50% 
wetcell pellet weight per volume of the buffer described above. 

The protein content of each supernatant was determined by the method of Lowry et al. (1951). The 
clarified cell extracts were assayed for HBsAg by AUSRIA and for the presence of a preS1 protein epitope 

35 by an Elisa assay. The monoclonal S1 .1 Elisa assay was performed as follows. Wells of plastic trays (Nunc 
immunopiate 1 , Gibco Europe, Ghent, Belgium) were coated with S1 .1 mouse monoclonal antibody in 50 
mM sodium bicarbonate buffer pH 9.6 for 2 hours at 37°C, and the antibody solution removed. The wells 
were then blocked by incubation with a PBS solution containing 1% (w/v) bovine serum albumin for 1 hour 
at 37°C. Serial two fold dilutions of cell extracts made in PBS, 0.2% (w/v) bovine serum albumin were then 

40 added to wells and incubated 1 hour at 37°C. 

The wells were then washed 3 times with 0.9% (w/v) NaCI, 0.05% (w/v) Tween 20. Antigen capture was 
revealed by incubation with monoclonal antibody S1 .1 coupled to horseradish peroxidase by the method of 
Wilson and Nakane (1978). 

Conjugated antibody in PBS, 0.2% (w/v) bovine serum albumin was added to each well and incubated 

4$ for 1 hour at 37°C. Second antibody binding was revealed by addition of 100 u\ of a solution containing 4 
mg o-phenylenediamine (Sigma), 15 ul hydrogen peroxide dissolved in 10 ml 0.1 M KH2PO4 pH 6.0 and 
incubation for 15 minutes at room temperature. The reaction was blocked by addition of 25 ul 1 N H2SO4, 
and the optical density read at 490 nm (with 620 nm for the reference filter) in an Intermed Immunoreader, 
NJ200 (Analis, Ghent, Belgium). Results were calculated from the linear portion of the optical density curves 

50 obtained and expressed as Optical Density Units per mg protein. 

The amount of AUSRIA reactive HBsAg and presl -protein determined in the cell extracts of S. 
cerevisiae and HL polymorpha from the two experiments are shown in Tables 4 and 5. The amount of 
AUSRIA reactive materials found in crude extracts of S. cerevisiae strain Y1108 is about one hundred-fold 
less than in extracts of H. polymorpha preS/S-C-453 induced for 38 to 45 hours in methanol. The amount of 

55 preS-protein assayed by" Elisa is about 3 to 8-fold less indicating that S. cerevisiae strain Y1108 makes a 
greater proportion of preS1 -protein to S-protein than does H. polymorpha strain preS/S-C-453. 
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TABLE 4 



AUSRIA assay of crude cell extracts 


Cell extract 


Flask 


Protein 


HBsAg (AUSRIA) 


preS1 -protein Elisa; OD 




Number 


(mg/ml) 


% total protein 


units per mg protein 


n 

S.cerevisiae 


a 


1 ft 0 


n ni 1 


2 9 


Y1108 


b 


18.0 


0.008 


3.0 




c 


12.0 


0.010 


4.4 


H.polymorpha 


a 


6.8 


0.077 


2.4 


preS/S-C-453 


b 


7.4 


0.123 


2.3 


glycerol 


c 


9.2 


0.143 


1.7 


culture 










Methanol 


a 


5.5 


1.15 


9.2 


induced 38 


d 


7.0 


1.40 


11.3 


hours 


c 


7.4 


1.46 


8.8 



TABLE 5 



AUSRIA assay of crude cell extracts 


Cell extract 


Flask 


Protein 


HBsAg (AUSRIA) 


preS1 -protein Elisa; OD 


protein 


Number 


(mg/ml) 


% total protein 


units per mg protein 


S.cerevisiae 


a 


13.3 


0.009 


8.3 


Y1108 


b 


15.0 


0.015 


9.7 




c 


17.1 


0.013 


8.9 


H.polymorpha 


a 


8.4 


0.070 


1.3 


preS/S-C-453 


b 


8.2 


0.061 


0.3 


glycerol 


c 


7.9 


0.082 


0.3 


culture 










Methanol 


a 


6.3 


1.106 


4.1 


induced 45 


b 


6.5 


1.314 


6.8 


hours 


c 


7.7 


1.688 


5.1 



The results were confirmed by immunoblotting of the extracts from the first experiment together with 
extracts of methanol induced cultures of strains 454 and 448-C4 as controls. 

Samples containing 15 ug of protein were electrophoresed through 12.5% separating, 5% stacking gels 
45 according to the method of Laemmli (1971) and subjected to immunoblotting as described above in 
Materials and Methods 5a using monoclonal HBS1 for detection. 

Extracts from strain preS/S-C-453 show a strongly reactive band at 24 kD related to the S-protein and a 
band at 38-39 kD related to preS-protein. A 45 kD band is barely detectable. In contrast extracts from S. 
cerevisiae strain Y1108 show weak bands at 24 kD and 45 kD. The 38-39 kD doublet band of preS-protein 
so was barely detectable. The immunoblotting result confirms that S. cerevisiae expresses less S and preS- 
proteins than methanol induced H. polymorpha and that the preS protein expressed by S. cerevisiae is 
predominantly the glycosylated 45"kD form. 

55 Example D.6 



Myristoylation of preS protein in H. polymorpha . 
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Cultures of strain LR9, strain 452 and 453 were grown in shake flasks on minimal medium with 2.5% v/v 
glycerol for 24 hours and methanol (1%, v/v) was then added. 6 hours after methanol addition, tritium 
labelled myristic acid was added and the cultures incubated for a further 16 hours. Cells were collected, 
washed and disrupted with glass beads in the presence of 1% SDS and 0-mercaptoethanol. The extracted 
5 proteins were separated by SDS-PAGE gel electrophoresis and visualised by immunoblotting with mon- 
oclonal HBS1 and by fluorography. The results showed that a band corresponding to the 38/39 Kd preS1 
protein was labelled in the extracts of stains 452 and 453 but not 1n the extract from LR9. This is consistent 
with post translations removal of the N-terminal methionine and myristoylation of the glycine residue at the 
second position of the polypeptide (Towler and Gordon, Ann. Rev. Biochem., 57 :69-99, 1988). 



Example D.7 



75 Immunogenicity of preS antigens from H. polymorpha 

Crude protein extracts were prepared from methanol grown cultures of strains 452 and 454 and 
adsorbed onto aluminium hydroxide at 1 mg/ml final concentration. The preS1 epitope content of each 
extract was measured in an ELISA assay using monoclonal antibody S1.1 and preS1 protein partially 

20 purified from S. cerivisiae as standard. Immune complexes for injection were also formed by incubating 600 
mcl of S1.1 monoclonal antibody with 1 ml of sepharose Protein G 4 Fast Row (obtained from Pharmacia 
LKB,. Brussels, Belgium) for 2 hours at 4°C. The mixture was then centrifuged, washed with PBS and the 
pellet resuspended in 1 ml PBS. 250 mcl of this suspension was added to 3.4 ml of each crude cell extract 
and the mixture incubated for 1 hour at 20°C. 

25 Following incubation the Sepharose Protein G/S1.1 /crude protein mixture was centrifuged and the pellet 
resuspended in 10 mM sodium phosphate buffer. Half of this material was adsorbed onto aluminium 
hydroxide at 1 mg/ml final concentration. 

Groups of 5 Balb/c mice were injected intraperitoneally with the four preparations. One month later the 
mice received an intraperitoneal booster injection of 1 meg of aluminium hydroxide adsorbed, partially 

30 purified particles from a methanol grown culture of strain 453. The mice were bled 15 days later. 

The titres of anti-S, anti-preS2 and anti-preS1 antibodies in the sera were determined in the same way as 
described hereinbelow. 

Table 6 shows the respective antibody titres induced in Balb/c mice expressed as Geometric Mean 
Titer. 

35 The results show that anti-preS antibodies are induced in mice when injected with preS1-S2-S 
containing extract (strain 454) or with extract containing composite particles (strain 452) and boosted with 
composite particles (strain 453). 



Table 6 

40 



Preparation 


meg of antigen 


a-HBS GMT 


a-120-145 


a12-32 


a32-47 




injected (a)(b) 


in mlU/m 


GMT (c) 


GMT (c) 


GMT (c) 


Strain 454 












crude extract 


222 


17755 


688 


240 


612 


Immune ppt 


155 


616 


518 


59 


1400 


Strain 462 












crude extract 


48 


124803 


426 


59 


<100 


Immune ppt 


48 


119903 


487 


64 


784 


Notes 













(a) Relative to preS1-S2-S antigen from S. cerevisiae 

(b) All mice received a booster injection at day 30 of particles partially purified from strain 



(c) Titres are expressed as the inverted dilution giving an OD of 1.0 in the assay. 
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Part E 

5 



Example E.1 

70 

Expression of composite particles containing both modified preS1-S2-S (L*) and S antigens of single or 
mixed subtype. 

ts (1) Construction of a L* (ad) coding sequence 

A modified preS1-S2-S (L*) coding sequence suitable for direct insertion on the Hansenula expression 
vector pMPT-121 (Figure 11) was constructed as follows. Plasmid DNA of pRIT13192 described herein- 
below was digested with Hindlll endonuclease to obtain linear fragments. About 1 ng of this DNA was mixed 

20 with oligonucleotides BC74 and BC75 and subjected to polymerase chain reaction (PCR) amplification. 

PCR technology is well known and is disclosed in 'PCR Protocols, A guide to methods and applica- 
tions 1 ; Innes M.A. et al. (Eds.) Academic Press Inc., San Diego CA. 1989. 

Oligonucleotides BC74 and BC75 have the sequences shown below and were synthesised by conven- 
tional phosphoramidite chemistry. t 

25 BC74 5' CCC AG A TCT AAG CTT ATT AAA TGT ATA CCC A 3 
BC75 5' CCC GAA TTC AAA ATG GGG ACG AAT CTT TCT 3 

Oligonucleotide BC74 has homology to the 3 end of the L* coding sequence on pRIT13192 together 
with a Hindlll and Bglll retriction site extension. Oligonucleotide BC75 has homology to the 5 end of the L* 
coding sequence on pRIT13192 together with an EcoRI site and a short leader sequence before the ATG 

30 codon as extension. About 1 ng of template pRIT13192 DNA was mixed with 50 mM KC1, 10 mM Tris-HC1 
PH8.3, 1.5 mM MgC1 2 , 0.01% (w/v) gelatin, 200 uM each of dATP, dCTP, dGTP and dTTP, luM of each 
oligonucleotide primer and 2.5 units of Tag polymerase in a final volume of 100 ul using commercially 
available DNA amplification reagents (Gene Amp DNA Amplification Reagent kit from Perkin Elmer Cetus; 
obtainable from Vander Heyden, Brussels). The mixture was subjected to 25 cycles of denaturation (2 min, 

35 92°), annealing (2 min, 48°) and extension (2 min, 72°) using a DNA Thermal Cycler (Perkin Elmer Cetus; 
obtained from Vander Heyden, Brussels). One ng of the modified, amplified L* coding sequence fragment 
so obtained was then reamplified in a second round with the same oligonucleotides in the same conditions 
and purified from the reaction mixture by ethanol precipitation. 

An aliquot of about 250 ng of the 870 bp adapted L* fragment DNA was digested with EcoRI and Bglll 

40 endonucleases and cloned between the EcoRI and Bglll sites of a pBR327 derivative plasmid, pRIT12555 to 
give pRIT13488. pRIT12555 consists of the pBR327 replicon with a Bglll site introduced by linker insertion 
between the EcoRI and Clal sites of the original plasmid. 

A further aliquot of the adapted L" fragment was digested with EcoRI and Bglll endonucleases and 
inserted between the EcoRI and Bglll sites of the plasmid pMPT121 (Figure 11). This places the L* coding 

45 sequence between the MOX promoter and the MOX terminator on an expression plasmid suitable for 
introduction into H. polymorpha by transformation. DNA sequencing of the resulting plasmid, pLM1, 
showed that the l/coding region sequence was correct. 



so (2) Construction of an S-(ay) coding sequence 

pRIT13490 is a pBR322 derivative plasmid containing a complete preS2-S coding sequence obtained 
by conventional recombinant DNA manipulations from pRIT10601 which contains the complete genome of a 
hepatitis B virus of ay subtype cloned on pBR322. pRIT10801 was deposited under the Budapest Treaty 
55 with the American Type Culture Collection, Rockville MD on June 2, 1982 under Accession Number ATCC 
39132. 

It will be appreciated by one of skill in the art that in the following manipulations described below 
PRIT13490 could be substituted by pRIT10601 or any other cloned HBV fragment containing a complete S- 



49 



EP 0 414 374 A2 



(ay) coding sequence. DNA of PRIT13490 was digested with Hindlll endonuclease to linearize the plasmid 
and mixed with the oligonucleotide BC74 described above and with oligonucleotide BC73 which has 
homology to the 5' end of the S coding sequence on pRIT13490 together with an EcoRI site and a short 
leader sequence before the ATG codon as extension. 

5 BC73 5' CCC GAA TTC AAA ATG GAG AAC ATC ACA TCA 3' 

The mixture of Hindlll treated pRIT13490 and oligonucleotides was amplified by 2 rounds of PCR as 
described above to generate a 700 bp S-(ay) coding sequence fragment flanked by EcoRI and Bglll 
restriction sites. About 250 ng of this material was digested with EcoRI and Bglll endonucleases and cloned 
between the EcoRI and Bglll sites of pRIT12555 to give pRIT13438. The correctness of the S-(ay) coding 

io sequence on pRIT13438 was verified by DNA sequencing. 



(3) Construction of a L* (ad/ay) coding sequence 

75 About 1 ng of the PCR amplified fragment used to construct pRIT13488 was further amplified by 2 
rounds of PCR amplification in the presence of oligonucleotides BC75 (described above) and BC30 having 
the following sequence. 

BC30 5 ACG AGT CTA GAC TCT GCG GTA 3' 

BC30 is homologous to the region around the Xbal site of the S coding sequence derived from the 
20 pRIT10616 HB virus clone of ad subtype. A 280 bp fragment was recovered from the amplification and 
about 250 ng of this was digested with EcoRI and Xbal endonucleases. 

About 1 ng of the PCR amplified fragment used to construct pRIT13438 was further amplified by 2 
rounds of PCR amplification in the presence of oligonucleotides BC74 (described above) and BC12 having 
the following sequence. 
25 BC1 2 5' ATG GAG AAC ATC ACA TCA 3' 

BC12 is homologous to the first 6 codons of the S coding sequences of the HB virions cloned on 
pRIT10601 and pRIT1Q616. Following PCR amplification about 250 ng of the 700 bp fragment was digested 
with Xbal and Bglll endonucleases. 

The two endonuclease digested, PCR amplified fragments were then ligated with EcoRI, Bglll digested 
30 DNA of pRIT12555. 

A plasmid, pRIT13491, was recovered consisting of pRIT12555 together with the EcoRI-Xbal fragment 
from the PCR amplification of the fragment used to construct pRIT13488 and the Xbal-Bglll fragment from 
the PCR amplification of the frament used to construct pRIT13438. DNA sequencing of the L* coding insert 
on pRIT13491 showed that an error had been introduced at the 5th amino acid of the S coding region. To 

35 correct this, the PCR amplified fragment used to construct pRIT13488 above was digested with EcoRI and 
Xbal endonucleases and used to replace the corresponding EcoRI-Xbal fragment on pRIT13491 which 
contained the error. The resulting plasmid pRIT13489 specifies a "hybrid" L" coding sequence in which the 
part from the ATG initiation codon to the Xbal site is derived from ad subtype virus sequence and the part 
from the Xbal site to the termination codon is derived from ay subtype virus sequence. The corresponding 

40 polypeptide will have the ay subtype. 

The EcoRI-Bglll fragment from pRIT13489 containing the hybrid L* (ad-ay) coding sequence was then 
inserted between the EcoRI and Bglll sites on plasmid pMPT-121 to give the H. polymorpha expression 
vector pL* 13. 

45 

(4) Expression in H. polymorpha of both modified preS1-S2-S (L*) and S antigens to form composite 
particles. 

Several strains were constructed which were characterised by different ratios of L* and S expression as 
so well as different total expression levels. These differences were achieved by variation of the copy number of 
the expression cassette per genome. 

H. polymorpha strain RB10 was transformed as described above with the plasmid pL* 11 bearing the L* 
(ad^expression cassette. Transformation yielded several strains containing variable numbers of expression 
cassettes integrated into the genome. Two of these strains, designated L13/1 and L19/1, were analysed in 
55 more detail. Southern blot analysis revealed that these strains contain several copies of the L* (ad)- 
expression cassette. The L" (ad)-expression level of these strains was assayed by Western blot analysis 
using monoclonal antibody HBS1. A band of 33 kD corresponding to L* antigen was detected. Expression 
levels were estimated to be about 0.5-2% of total cell protein when cells were grown in glycerol medium 
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followed by addition of methanol. 

The L* (ad)-expressing strains were transformed again with plasmid pRB-S-322 bearing the S(ad)-gene 
under control of the MOX-promoter. PRB-S-322 is described above (see Figure 7). Resistance against 
gentamycin was used as a transformation marker. Stable integrants were obtained employing the methods 

s described above. Two of the strains obtained were studied in more detail: strain LMS9/1 and strain 
LM526/2, both derived from recipient strain L13/1. Southern blot analysis revealed that these strains carry 
several copies of the L*(ad) expression cassette as well as several copies of the S(ad) expression cassette. 

The expression of L* and S antigens in the H. polymorpha transformants was tested by Western blotting 
using monoclonal antibodies HBS1 and S1.1 as~described above. A spectrum of strains was obtained which 

w express different ratios of L* to S antigen. HBS1 antibodies detect a 24 kD protein corresponding to S 
antigen, and 30 and 33 kD bands corresponding to L* antigen. S1.1 antibodies specific for the preS1 region 
reveal a 33 kD band as well as the 30 kD band. The LVS ratios were estimated to be about 1:1 for strain 
LMS9/1 and about 1:10 for strain LMS26/2. Expression levels were estimated to be about 5-6% of the total 
cell protein when cells were grown under induced conditions with methanol. 

is Glycosylate studies showed that the 33 kD protein represents a glycosylated form of the L" antigen. 
When crude protein extracts from strains LMS9/1 and LMS26/2 were incubated with endoglycosidase H, the 
33 kD band shifted to the apparent molecular weight of 30 kD. 30 kD is the expected molecular weight of L* 
antigen. Thus part of the L* antigen produced in H . polymorpha is "core "-glycosylated while the remainder 
constitutes the unglycosylated form of L*. 

20 In order to test for particle formation crude cell extract from strain LMS9/1 was subjected to CsC1 
density gradient centrifugation. Gradient fractions were analysed by immunoblotting using antibody HBS1. A 
peak of HBsAg-reactive material including the 24 kD S antigen and the 30 and 33 kD L* antigen forms was 
observed at a density of 1.18 g/cm 3 . This result indicates that composite particles were formed in H. 
polymorpha strains coexpressing the L* and S polypeptides. 

25 In a further variation of the invention a H. polymorpha strain coexpressing the L* and S proteins was 
obtained by further transforming strain 415~described above. Strain 415 carries between 50-100 S(ad) 
expression cassettes under the control of the MOX promoter and is characterised by a high S antigen 
expression level. 

A L* (ad) expression vector containing a kanamycin-resistance gene was constructed. A 3.5 kb Nrul- 

30 fragment encoding the kanamycin-resistance gene under control of the ADHI promoter and the HARS 
sequence was isolated from plasmid pHK2 (Figure 6). Plasmid pL*H was digested with Nrul and sail 
endonucleases and made blunt ended by fill-in with Klenow DNA polymerase. The Nru I/Sal l-digest 
eliminated the HARS sequence from the pL*11-vector. The Nrul-fragment from pHK2 was then ligated to the 
blunt ended large Nrul-Sall fragment of the pL*11 -vector thus introducing the kanamycin-resistance gene 

36 and re-introducing the HARS sequence which had been eliminated. The resulting plasmid is designated 
pKL*1 or pKL*10 depending on the orientation of the kanamycin resistance gene cassette. The structure of 
pKL*1 is shown in Figure 12. These plasmids can be introduced into H. polymorpha by transformation and 
selection for gentamycin resistance as described above in Methods. 

Strain 415 was transformed with pKL*1 and pKL*1 0 carrying the L*(ad) expression cassette and stable 

40 transformants were obtained after selection for resistance to gentamycin. Several strains were further 
analysed. Western blot analysis using antibody HBS1 showed a 24 kD protein corresponding to S antigen 
as well as the 30 and 33 kD bands corresponding to the unglycosylated and glycosylated forms of L* 
antigen respectively. In these 415 derived strains the LVS ratio varied from about 1:1 in strain LMS05-14-1 
to about 1:20 in strain LMS05-12-1. In strains grown under fully induced conditions with methanol, LVS 

45 antigen expression levels amounted to an estimated 5-6% of the total cell protein. No differences between 
strains transformed with pKL*1 and strains transformed with pKL*1 0 were observed. 



(5) Co-expression of L" and S antigens leading to composite particles with mixed subtype. 

50 

To obtain H. polymorpha strains which produce L7S-particles with a mixed (ad/ay) subtype, plasmid 
pLM3 bearing a~L* (ad/ay) expression cassette was transformed into H. polymorpha RB10. 

Stable clones expressing the L* antigen can be obtained and retransformed with plasmid pRB-S-322 
carrying the S(ad) expression cassette with selection for gentamycin resistance. Stable strains from the 
55 second transformation are analysed by Western blotting and AUSRIA assays to determine the respective 
expression levels of the L* (ad/ay) and S(ad) antigens and the total level of surface antigen expression. 

Strain 415 expressing S(ad) protein may also be retransformed with a derivative of pL*13 which carries 
a functional gentamycin resistance marker to generate stable transformants expressing both the L* (ad/ay) 
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and S(ad) polypeptides in the form of composite particles with both ad and ay subtype specificities. 



Example F.1 

5 

Construction of plasmid pRIT12979 coding for a non-myristylated HBV large protein 

The outline of the construction of plasmid pRIT12979 is shown in Figs. 13 and 14. Fig. 13 shows the 
10 steps for constructing an E. coli plasmid harbouring a A Gly13 L protein-expression cassette, pRIT12998. 
Fig. 14 illustrates the procedure~for obtaining a yeast plasmid pRIT12979 capable of expressing a A Gly13 
L protein. 

It is understood that the Gly 13 residue is encoded by the second codon of the L protein coding 
sequence represented in Table A. 

15 The starting material for this construction is plasmid pRIT12863, a derivative of pBR327 which contains 
the TDH3 promoter region and the ARG3 transcription termination region [Cabezon et al., Proc. Natl. Acad. 
Sci. USA , 81 :6594-6598 (1984)] separated by a BamHI, Smal, EcoRI linker. A linker containing a Bglll site 
iFplaced upstream of the TDH3 promoter sequences. pBR327 is described in Soberon et al, Gene , 9 :287- 
305 (1980) and was obtained from F. Bolivar (Department of Molecular Biology, National University of 

20 Mexico). This plasmid may be recovered as described in co-pending U.S. application SN 009,325. 

The expression cassette for the non-myristylated L protein is constructed within the linker region of 
pRIT12863. 

The plasmid pRIT12816, which is an E coli plasmid derived from pASl [Rosenberg et al, Methods 
Enzymol 1CM :1 23-1 64 (1983)], harbours~the~~coding region for the large HBV (adw serotype) protein 
25 flanked by arTNcol site (overlapping the ATG at codon position 12 in the L protein coding sequence) and by 
an EcoRI site beyond the stop codon. This plasmid is constructed by conventional recombinant DNA 
techniques [see, T. Maniatis et al, cited above]. 

Plasmid pRIT12816 is digested with BStXI and EcoRI and the 304bp BStXI-EcoRI fragment encoding 
the N-terminal portion of the preS1 region is recovered and ligated to the following synthetic BamHI-BstXI 
30 adaptor: 

BamHI Xbal BstXI 
GATCCCTCTAG ACG AAT CTT TCT GTT CCC AAC C 
35 GGAGATC TGG TTA TAA AGA CAA GGG TT. 

This fragment is inserted into pRIT12863, previously digested with BamHI and EcoRI enzymes, 
resulting in plasmid pRIT12996. The BamHI-BstXI synthetic adaptor harbours the N-terminal preS1 region, 
40 but does not provide the correct reading frame with the BamHI site placed at the ATG residue of the TDH3 
promoter. The correct reading frame is provided by digesting pRiT12996 with BamHI and Xbal, followed by 
treatment with mung bean enzyme to eliminate the protruding extensions and ligation. This procedure 
produced plasmid pRIT12997. 

Finally, the 839bp EcoRI fragment from pRIT12816 was inserted into the EcoRI site of pRIT12997 in the 
45 correct orientation to recover the complete coding sequence for the A Gly L protein. The resulting plasmid 
pRIT12998 contains the expression cassette coding for an L protein which is deleted for Gly13 and cannot 
serve as substrate for the myristoyl transferase. 

To construct a yeast plasmid capable of expressing the A Gly L protein, plasmid pRIT12998 was 
digested with Bglll and Sail. The 5822 bp Bglil-Sall fragment that contains the expression cassette was 
5Q purified and inserted into a conventional yeast vector pRIT12741 which is a pBR327 derivative that supplies 
an AmpR gene, the 2u sequences, the E. coli replicon and the LEU2 yeast marker for selection. 

The resulting plasmid pR!T12979 was "used to transform the LEU- S. cerevisiae strain 10S44 c cir° (pep4- 
3, ieu2-3, Ieu2-112), otherwise called TCY1 or Y482, which is deposited in the American Type Culture 
Collection, Rockville, Maryland, U.S.A. under accession number ATCC 20818. After transformation, the 
S5 resulting LEU + yeast strain is called Y720. 



Example F.2 
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Comparative Characterisation of A Gly L Protein with L Protein 

Yeast strain Y720 of Example F.1 was examined for A Gly L protein expression in comparison with 
yeast strain Y587 (TCY1 transformed by pRIT12845) which expresses the L protein. pRIT12845 contains a 
5 3370 bp expression cassette consisting of a HBV DNA derived coding sequence for the 389 amino acids of 
the L protein flanked 5 by, and under the control of, a 1050 bp TDH3 promoter fragment and 3' by a 1150 
bp fragment carrying the ARG3 transcription terminator. pRIT12845 is described in detail in Example 16A of 
co-pending U.S. Patent Application SN 009,325. Yeast strain Y1017, which is TCY1 transformed with 
pRIT12741, served as a negative control. 



A. Myristylation 

Cells from Y720, Y587 and Y1017 were labelled with 3 H-myristic acid as described in co-pending U.S. 
Patent Application SN 009,325 . SDS-extracts of these cells were analysed by SDS polyacrylamide gel 
electrophoresis (SDS-PAGE), followed by both immunoblot and fluorography [See, e.g., Towler and Glaser. 
Proc. Natl. Acad. Sci ., U.S.A ., 83 :281 2-281 6 (1 986)]. 

The immunoblot as described in co-pending U.S. Patent Application SN 009,325 involved detection with 
an S-specific monoclonal antibody (Mab), either Mab RF6 [H. Thomas, Royai Free Hospital, London] or 
Mab HBs1 [smithKline Biologicals]. These antibodies recognize overlapping denaturation and reduction 
resistant epitopes in the S protein. The immunoblot showed the expression of two forms of L protein of 
38kd and 45kd in both Y720 and Y587. 

The fluorograph showed the presence of 3 H label in the 38 kd L protein derived from Y587 cells as 
described in co-pending U.S. application SN 009,325. Minor amounts of label were observed in the 45kd L 
protein derived therefrom. In contrast, neither the 38kd nor the 45 kd A Gly L protein derived from Y720 
cells had incorporated the label. This demonstrates that the elimination of the glycine codon in pRlTl2979 
abolishes the myristylation of the A Gly L protein in S. cerevisiae . 



30 B. Expression level 

Cells from Y720, Y587 and Y1017 were grown under identical conditions in YNB without amino acids 
[Difco Labs] 0.675% containing 2% glucose in Erlenmeyer flasks. Both total disrupted cells and crude 
extracts were analyzed by quantitive immunoblot as described in co-pending U.S. application SN 009.325. 

35 Two-fold serial dilutions of the samples were analyzed and the immunodetection was based on the S- 
specific Mabs RF6 or HBs1, or a preS1 -specific monoclonal antibody, S1.1 [SmithKline Biologicals]. The 
immunoblots displayed a 2 to 5-foid higher expression level for the A Gly L protein (Y720) than the 
myristyiated L protein in Y587. Extraction efficiency appeared to be about equal for the non-myristylated 
and myristyiated proteins. The proportion of L protein that was found in the 45 kd glycosylated form was 

40 not changed by the elimination of the myristylation. 



C. Lipoprotein structures 

45 Crude extracts from Y720 and Y587 cells were subjected to CsCI equilibrium centrifugation. Gradient 
fractions were analyzed by immunoblot, AUSRIA assay [Abbott Labs], an ELISA assay specific for a preS1 
epitope using MAb S1.1, in which MAb S1.1 was coated on the wells of a microtitre plate [Immunoplate I; 
Gibco Europe] and incubated with the test samples. After washing to remove non-bound material, capture of 
reactive particles was revealed by incubation with Mab S1.1 coupled to horseradish peroxidase by the 

so method of Wilson and Nakane (1978) in 'Immunofluorescence and Related Techniques 1 , W. Knapp et al., 
eds., Elsevier, Amsterdam (1978). Colour development to reveal second antibody binding was with 
orthophenylenediamine as a chromogen. Absorbance was read at 490 nm with 620 nm for the reference 
filter, in a intermed Immunoreader, NJ2000 [Analis, Ghent, Belgium]. 

The antigenic and immunoblot profile were identical. Both the A Gly L and the wild type L protein 

55 banded at a density of about 1.25 g/cm 3 , demonstrating that both the myristyiated and non-myristylated 
proteins are present as lipoprotein structures in crude cell extracts. Sucrose gradient velocity centrifugation 
of CsCI equilibrium gradient purified material showed identical sedimentation behaviour for the myristyiated 
and non-myristylated L protein containing lipoprotein structures, indicating that these lipoprotein structures 
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have comparable physical parameters. 



Example F.3 

5 

Construction of plasmids coding for a modified HBV L protein devoid of potential 0- and N- glycosylation 
sites, potential pHSA binding sites, and potential protease sensitive sites" ~ 

w An outline of the construction of two yeast vectors: pRIT13192, coding for the expression of a 
myristylated modified L protein (L*) and pRIT13193, coding for the expression of a non-myristylated 
modified L protein (A Gly L*) is presented in Figs. 15 and 16. 

pRIT10616, deposited as ATCC 39131, is digested with Bgll!, and the largest fragment ligated to the 
PACYC177 replicon [Chang and Cohen, J . Bacteriol , 134 :1141 (1978)], previously digested with BamHI. 
75 The intermediate plasmid obtained was digested by "EcoRI and reiigated to obtain pRIT10633. which 
harbours an incomplete HBV genome from which the pACYC184 vector has been excised. 

Plasmid pRIT12331 is a pUC9 [Amersham, U.K. and Pharmacea, Sweden] derivative containing the 
1500 bp Hindlll-Ncol ARG3 promoter fragment from pRIT10779 [Cabezon et al, Proc. Natl. Acad. Sci. U.S.A 
81 :6594-6598 (1984)] fused in the correct orientation to a 681 bp Ncol-EcoRI fragment which contains the 
20 S coding sequences with the Nco! site overlapping the ATG codon and the EcoRI site beyond and adjacent 
to the stop codon, inserted between the Hindll! and EcoRI restriction site of pUC9. 

PRIT10633 is digested with BStEII and Xbal enzymes. The 648 bp BstEil-Xbal fragment, in which the 
BstEII extension was filled in by Klenow polymerase, containing the entire PreS1-PreS2 region and the N- 
terminai sequences of the S region was purified and inserted between the Smal and Xbal sites of pUC12 
25 [Amersham]. The resulting plasmid is pRIT13188. 

Plasmid pRIT13188 was treated with Ball and BamHI, which deleted the preS1 region coding for the 
amino acids extending from residues 52 to 133 [the codons for amino acid 53 and 132 were included in 
deletion]. The large fragment was treated with Klenow polymerase and ligated, resulting in plasmid 
pRIT13189 in which the BamHI site is reformed. 
30 The DNA corresponding to the preS amino acid region extending from residue 145 to 175 [the codons 
for amino acids 146-174] were deleted by digestion of pRIT13189 with BamHI and Xbal. This vector was 
ligated to the following synthetic adaptor: 

BamHI extension Ncol extension 

35 

GATCCCAGAGTCAGGGGTCTGTATTTTCCTGCTGGTGG 

GGTCTCAGTCCCCAGACATAAAAGGACGACCACCGTAC 

40 AspProArgValArgGlyLeuTyrPheProAlaGlyGly 

135 145 



and to the Ncol-Xbal fragment from pRIT12331 containing the N-terminal portion of the S gene. The 
resulting plasmid is pRIT13190. 

The C-terminal region of the S gene, the termination of transcription ARG3 region and the LEU2 yeast 
gene were added by insertion of the 4145 bp Xbal-Sall fragment from pRIT12660 between the Xbal and 
Sail sites of pRIT13190. [pRIT12660 contains a 3050 bp expression cassette consisting of a HBV DNA 
derived coding sequence for the 281 amino acids of the M protein flanked 5' by, and under the control of, a 
1050 bp TDH promoter fragment and 3' by a 1150 bp fragment carrying the ARG3 transcription terminator. 
PRIT 12660 is described in detail in copending U.S. patent application SN 009,325.] The resulting plasmid 
pRIT13191 was obtained. 

To reconstitute the expression cassettes for the L* and A Gly L* protein the TDH3 promoter associated 
either to the N-terminal coding region for aL protein or to the N-terminal coding region for a A Gly L protein 
was inserted into pRIT13191. 

The 1234 base pair EcoRI-Ball fragment was purified from pRIT12845, containing the L protein coding 
sequence described above and in U.S. SN 009,325, and the 1227 bp Bglll-Ball fragment was purified from 
pRIT12979 containing the A Gly L protein coding sequence as described above. 
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Each fragment was mixed with the large Sacl-BamHI fragment from pRIT1319l followed by treatment 
with T4 polymerase and ligation. The resulting plasmids were designated pRIT13220 and pRIT13222. The 
coding sequence contained in the expression cassette present in pRITl3220 contains the amino acid 
residues of 12 to 52 from the preS1 sequence, 133 to 145 from the preS2 sequence and the entire S 
s sequence as indicated in Table A. pRIT13222 is identical to pRIT13220 with the exception of the deletion of 
the Gly13 codon (GGG). 

The last step was the preparation from pRIT13220 and pRIT13222 of the Clal-Clal fragment containing 
each expression cassette for the L* and A Giy L* proteins respectively and ligation of them to the Clal-Clal 
fragment from plasmid pRIT12845, giving rise to plasmids pRIT13192 and pRIT13193. respectively. 
w These latter plasmids were used to transform the S. cerevisiae strain TCY1 , to give LEU strains Y1 1 39 
and Y1140, respectively. 



Example F.4 

15 

Expression of L* and A Gly L* in yeast 

Crude yeast extracts from Y587 (L protein), Y1139 and Y1140 (L* and A GlyL* respectively] of Example 
20 F.3 and Y724 (control S. cerevisiae strain, no expression cassette inserted) were subjected to SDS-PAGE 

and immunoblot detection with either the S-specific Mab HBs1 or Mab S1.1. The Y1139 and Y1 140 extracts 

showed two bands of estimated molecular weight of about 33 and 30 kd which were specifically recognized 

by both the preS1 and S specific antibodies. 

Treatment of the extracts with endoglycosidase H [New England Nuclear or Boehringer] or 
25 glycopepsidase F [Boehringer] resulted in the disappearance of the 33 kd band and an increase of the 30 

kd band. This indicates that the 33kd band represents a glycosylated form of the L* and A Gly L'protein of 

Example F.3 carrying one N- linked high mannose type glycan. 

Labelling of cell cultures with 3 H myristic acid followed by analysis of cell extracts by SDS-PAGE, 

protein blotting and either fluorography or immuno detection, showed a strong presence of 3 H-label in the 
30 30 kd band and a weak incorporation of 3 H-label in 33 kd derived from the Y1139 extract The 3 H present in 

these bands was resistant to hydroxylamine treatment. No incorporation of 3 H-label was observed in the 

corresponding protein bands from the Y1140 extract. This shows that the L* protein as expressed in Y1139 

cells serves as a substrate for the yeast N-myristyl transferase and that deletion of the Gly 13 residue in the 

A Gly L" protein of Y1140 prevents the myristoylation of the protein. 
35 Analysis of extracts from Y1189 and Y1140 by CsCI equilibrium and sucrose velocity gradient 

centrifugation indicated the presence of lipoprotein particles in the extracts. 

Example F.5 

40 

Yeast Strains Y1088 and Y1295 Containing Integrated Copies of an S Protein Expression Cassette 

Yeast strains harbouring several copies of an S expression cassette integrated in the genome were 
45 constructed. 

A. Yeast Strain Y1 088 

so The Ty based linear vector pRIT13133-L described in co-pending U.S. patent application SN 368,401 
containing an expression cassette for the S protein (TDH3 promoter - S protein coding sequence - ARG3 
termination region) was used to transform both mating types of S. cerevisiae strain 10569d ( ura 3, leu 2, 
trp1 , gal 1, cir°) described in co-pending U.S. patent application SN 368,401. Different haploid transfor- 
mants carrying several vector copies integrated in their genomes were crossed with the haploid strains of 

55 opposite mating type. Southern blot analyses of genomic DNA isolated from segregants using an S DNA 
fragment probe, show 2/2 segregations of vector fragments in tetrad analyses. One haploid segregant 
obtained by this means, which contains 5-7 copies of vector pRIT13133-L, was named Y957. A TRP* 
revertant of Y957 was obtained and named Y1088. 
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B. Yeast Strain Y1295 

The Ty based linear vector pRIT13034-L described in co-pending U.S. patent application SN 368,401 
carries an expression cassette for the S protein identical to that in pRIT13133-L pRIT13034-L carries the 

5 CUP1 gene as an additional marker that can be used in CUP1 8 or cupl A recipient strains. 

Two cupl A S. cerevisiae haploid strains (CUP1 gene disrupted in a strain containing a single copy of 
that gene) are preferred as recipient yeast strains for this Ty linear vector. These strains have the 
respective genomes: EJ cupl A 3d (ura 3, leu 2, trp 1, gal 1 A , cup 1 A , a) and EJ cup 1A7b (ura 3, trp 1, 
gal 1A f cup 1 A, a) and are described in co-pending U.S. patent application SN 368,401. 

w The Ty based linear vector pRIT13034-L containing an expression cassette for the S protein was used 
to transform S. cerevisiae strains EJ cupl A 3d as well as EJ cupl A 7b. URA3 transformants were 
isolated, and "screened for resistance to copper toxicity. The more resistant transformants appeared to 
contain 2 to 5 copies of integrated vectors. Classical genetic crosses between these strains and selection of 
a LEU" TRP" haploid segregant resulted in a haploid strain containing 4 to 5 copies of the S expression 

75 cassette. A TRP* revertant of this strain was named Y1295. 



Example F.6 

20 

Construction of yeast strains that co-express the S and L antigens 



The yeast plasmids pRIT12845 (L protein), pRIT12979 (AGly L protein, Example R1), pRIT13192 (L* 
protein, Example F.3), pRIT13193 (AGly L* protein, Example F.3) and pRIT12914 (no hepatitis gene 
25 inserted) were each used to transform the yeast strain Y1088 to LEU*. The resulting yeast strains obtained 
were designated Y1301 (S, L), Y1302 (S, AGly L), Y1142 (S, L"), Y1261 (S, AGly L*) and Y1141 (control 
strain expressing S protein only) respectively. 

Likewise, pRIT12845, pRIT12979, pRIT13192, pRIT13193 and pRIT12377 (no hepatitis genes inserted) 
were introduced into the yeast strain Y1295. The resulting yeast strains obtained were designated Y1304 (S, 
30 L), Y1305 (S t A Gly L), Y1306 (S, L*), Y1307 (S, A Gly L*) and Y1308 (control strain expressing S protein 
only) respectively. 

Co-expression of the S and L proteins was demonstrated by immunobiot analysis of crude cell extracts. 
Reaction of the blot with Mab HBs1 specific for an S epitope revealed a band of estimated molecular weight 
of about 23 kd migrating at the position expected for the S protein in each of the extracts of the above 
35 mentioned strains. 

Extracts from Y1301, Y1302, Y1304 and Y1305 showed in addition to the 23 kd band, bands of 38 kd 
and 45 kd migrating like the non-glycosylated and glycosylated forms of the L proteins previously described 
in U. S. application SN 009,325, and in Example F.2 herein. Treatment of the extracts with endoglycosidase 
H resulted in the conversion of the 45 kd band into a 41 kd band while the 23 kd and 38 kd bands were not 
40 affected. 

Extracts from Y1142, Y1261, Y1306 and Y1307 showed in addition to the 23 kd band, bands of 30 kd 
and 33 kd migrating like the non-glycosylated and glycosylated forms of the L* and A Gly L* proteins 
described in Example F.4. Treatment of the extracts with endoglycosidase H resulted in the conversion of 
the 33 kd band into a 30 kd band which is identical to the results described in Example F.4. 

45 Labelling of cells with 3 H myristic acid and analysis of extracts by SDS-PAGE followed by immunobiot 
and fiuorography demonstrated the presence of hydroxylamine treatment resistant 3 H-label in the 38 kd 
(extracts from Y1301 and Y1304) and 30 kd bands (extracts from Y1142 and Y1306) with only minor 
quantities of 3 H-label in the 45 kd (extracts from Y1301 and Y1304) and 33 kd (extracts from Y1142 and 
Y1306) protein bands. The corresponding bands in extracts from Y1302, Y1305, Y1261 and Y1307 

so contained no detectable 3 H-label. 

The 38 kd and 45 kd forms of the L and Gly L proteins are recognized in immunoblots by the preS1- 
specific Mab's S1.1 and MA18/7 [W.H. Gerlich, University of Gottingen, FRG], the preS2-specific Mab's 
S2.4, S2.5, S2.7, S2.8, S2.9 and S2.10 [Smith Kline Biologicals]. The 30 kd and 33 kd forms of the L* and A 
Gly L* proteins are recognized by Mab S.1.1 (recognizing an epitope present in amino acid sequence 12- 

55 32), MA18/7 (recognizing an epitope present in amino acid sequence 28-47) and S2.5 (recognizing an 
epitope present in amino acid sequence 133-145). Monoclonal MA 18/7 is reported to inhibit binding of 
virions to liver cell membranes [Pontisso P. et al., Virology, 173 :522-530 (1989)]. The 30 kd and 33kd 
forms of the L* and A Gly L" proteins are not recognized by Mab's S2.4, S2.8, S2.9 which do not react with 
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peptide 120-145 in an ELISA assay nor by Mab's S2.7 and S2.10 which presumably recognize an epitope 
localised in amino acids 120-137. Monoclonal F35-25 (obtained from M-A Petit, INSERM unite 131, Clamart, 
France) binds to particles purfied from strain Y1307 as shown by an ELISA assay but does not bind to S 
particles. This monoclonal recognises a peptide sequence (Petit M-A et al., Molec. Immunol., 26 :531-537, 

s 1989) shown to be involved in binding of virus to HepG2 hepatoma cells (Neurath et al., Cell 46:429-436, 
1986). Furthermore It has been found that (S, L*) particles bind to HepG2 hepatoma cells and that this 
binding, and the binding of HB virus particles can be prevented by Monoclonal F35.15 (Petit M-A et al. The 
1990 intl. Symposium on Viral Hepatitis and Liver Disease, Abstract 105. Houston, U.S.A., April 4-8, 1990). 
The Mab Q19/10 [W.H. Gerlich, University of Gottingen, FRG] which recognizes a glycosylate dependent 

70 preS2 epitope [Heermann et al. in "Viral Hepatitis and Liver Disease", Eds. A. Zuckerman and Alan R. Liss, 
Inc. New York pp. 697-700, (1988)] and reacts with the 45 kd form of the L and Gly L proteins does not 
react with the 30 or 33 kd forms of L* or A Gly L* proteins. These results are in accordance with the amino 
acid deletions introduced in the L* andAGIy L* proteins. 

75 

Example F.7 

Co-expression of S and L or modified L proteins leads to formation of particles of mixed subunit 
20 composition 

Extracts made from each of the strains described in Example F.6 contain HBsAg related antigenicity as 
demonstrated by positive results in the AUSRIA assay performed according to the manufacturer's instruc- 
tions [Abbott Lab.]. Extracts from Y1301, Y1302, Y1142, Y1261, Y1304, Y1305, Y1306 and Y1307 but not 

25 those from Y1141 and Y1308 give a positive reaction in the EL1SA-S1.1 assay. When subjected to the CsCI 
equilibrium centrifugation the AUSRIA and ELISA-S1.1 positive material cobanded around a density of 1.2 
g/cm 3 . Immunoblots confirmed the antigenicity profiles showing that the protein species found in crude 
extracts cobanded in the AUSRIA and ELISA-S1 .1 positive peak. Sucrose velocity gradient centrifugation of 
dialyzed peak fractions derived from the CsCI equilibrium gradients showed co-sedimentation of the S and 

30 L proteins or modified L proteins as determined by AUSRIA, ELISA-S1.1 and immunoblot This dem- 
onstrates that S and L or modified L proteins are assembled into lipoprotein particles. 

Immunoprecipitation reactions with Mab S1.1 were used to demonstrate that the S and L or modified L 
proteins were associated into one physical entity i.e. particles of mixed subunit composition. Extracts were 
made from yeast strains expressing the S and L or modified L proteins either alone or together, for 

35 instance, from Y1139 (L* protein, Example F.3), Y1141 (S protein, Example F.6) and Y1142 (S and L* 
protein, Example F.6). After semi-purification of the particles by CsCI banding and dialysis of the peak 
fractions, the particles were subjected to the immunoprecipitation reaction with Mab S.1.1. A mixture of 
particles from Y1139 and Y1141 was used as an additional control. The L* protein was immunoprecipitated 
in each case. Coprecipitation of the S protein with the L* protein was observed only from the reaction with 

40 particles derived from Y1 142. 

Identical results were obtained with the other yeast strains that co-express the S and L or modified L 
proteins (Y1301, Y1302, Y1261, Y1304, Y1405, Y1306, Y1307). 



45 Example F.8 



Stability and pHSA binding of particles containing the modified L protein 

so The sensitivity to proteolytic degradation by yeast proteases of the L* and A Gly L* proteins was 
analysed by incubating crude cell extracts of Y1142 and Y1161 for either 4 hours at 37°C or 65 hours at 
4°C. 

Extracts of Y1301 and Y1302 served as comparative source of L and A Gly L protein. Analysis of 
extracts by immunoblot before and after incubation showed an almost complete disappearance of the L and 
55 A Gly L proteins after incubation while the L* and A Gly L" proteins were still readily detectable. 

Purified preparations of particles from strains Y1306 and Y1307 were incubated for 7 days at 37°C and 
examined by SDS-PAGE and silver staining to reveal the polypeptides. No detectable degradation of the L* 
or S polypeptides could be observed. 
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Particles derived from Y1142 by CsCI banding were tested by a modification of the pHSA binding assay 
described by Pontisso et al. J. Virol. Methods 6 , 151-159 (1983). Microplates (Immunoplate I; Nunc, Gibco 
Europe) were coated with glutaraldehyde polymerized human serum albumin (5 ug/ml pHSA in PBS, 2 hrs 
37°C) and non-specific protein binding sites on the plates were saturated by incubation (1 hr 37°C) with 
5 PBS containing 1 % bovine serum albumin (BSA). Serial dilutions of the test samples in PBS containing 0.2 
% BSA were allowed (1 hr 37°C) to interact with the pHSA coated plates and bound particles were detected 
(1 hr 37°C) with a non-limiting concentration of horseradish peroxidase coupled anti-S monoclonal antibody 
(RF1, H.C. Thomas, Royal Free Hospital, London) in PBS containing 0.2 % BSA and revealed with H2O2 
and orthophenylenediamine as substrate. Between incubations the plates were washed with 0.15 M NaCI 
70 containing 0.05 % Tween 20. 

Particles derived from Y1141 served as negative control and purified particles from strain 10S44C cir° 
containing pRIT12660 as positive control. pHSA binding was greatly diminished in the Y1142 derived 
particles (2 % residual activity) compared to the particles derived from 10S44C cir° harbouring pRIT12660. 

pHSA binding was also not detectable with purified particle preparations from strains Y1306 and Y1307. 

75 

Example F.9 

20 Immunogenicity of particles containing L* protein 

The immunogenicity of (S, L*) particles of this invention was studied in Balb/c mice. Particles derived 
from either Y1142 (S, U) or Y1141 (S) were partially purified by CsCI banding, and adsorbed on AI(OH) 3 . 
Groups of eight mice were injected with either 50 or 5 ug total protein (corresponding to 1 or 0.1 ug 

25 antigen by AUSRIA assay) of each antigen preparation at days 0 and 30. The bleeding of mice was done at 
day 45 and the different assays performed on individual sera. Anti-HBs antibodies were measured with the 
AusAb kit (Abbott, USA) and expressed in mlU/ml using the Hollinger formula [Hollinger et al., in "Viral 
Hepatitis" Szumness et al. Franklin Institute press, Philadelphia pp. 451-466 (1982)]. 

Anti-preS antibodies were measured using a direct ELISA assay in which selected synthetic peptides 

30 (peptide 12-32, peptide 32-47 and peptide 120-145) were adsorbed to polystyrene walls of microplate 
(Immunoplate I; Nunc, Gibco Europe). Adsorption took place overnight at 4°C from a solution of 0.5 ug/ml 
peptide in 0.05 M NazCCVNaHCOa buffer pH9.6, 100 ul per well. Non-specific sites on the plates were 
blocked by incubation with 250 til, 5 % foetal bovine serum in PBS during 1 hour at 37°C. 

Two fold serial dilutions of the mice sera in PBS containing 1 % foetal bovine serum, 0.05 % Tween 20 

35 were allowed to react with the peptide-coated plate during 2 hrs at 37°C (100 ul per well). After washing, 
bound antibodies were detected by a biotinylated anti-mouse Ig from sheep (Amersham, 500 fold dilution in 
PBS containing 1 % foetal bovine serum, 0.05 % Tween 20, 100 ul per well, incubation for 1 hour at 37°C) 
and then with a streptavidin - biotinylated horseradish peroxidase complex (Amersham, 1000 fold dilution in 
PBS containing 1 % foetal bovine serum 0.05 % Tween 20) during 1 hour at 37°C, 100 ul per well. 

40 Between incubations, the plates were washed with 0.15 M NaCI, 0.05 % Tween 20. Rnal revelation was 
by H20z and ortho-phenylenediamine as chromogen (15 ul H2O and 4 mg ortho-phenyienediamine in 10 ml 
0.1 M potassium phosphate buffer, pH6.0, 100 ul per well). The reaction was stopped after 20 min by the 
addition of 25 ul 1 N H2SO4 and absorbance at 490 nm measured in an Intermed Immunoreader, NJ 2000, 
Analis. The titer for each serum was calculated as the reciprocal of the highest dilution giving an optical 

45 density of 1.00. 

Table 7 shows the respective antibody titers induced in Balb/c mice expressed as Geometric Mean 
Titer. 
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Table 7 



5 



Particles 

UOI ivcu 

from 


Dose 


a-HBs GMT 

in mill/ml 


a1 20-1 45 
GMT* 


a12-32 
GMT* 


«32-47 
GMT 


Y1142 


50 


304000 


- 629 


5222 


2255 


(S.L-) 


5 


234000 


212 


1490 


785 


Y1141 


50 


128000 


202 


211 


<200 


(S) 


5 


29000 


<200 


203 


<200 



0 The titers were expressed as the inverted dilution for an OD of 1 .0. 



Particles purified from strain Y1307 (S, A gly L*) or from strain RIT4376 [Petre J. et al., Postgrad. Med. 
J. 63 , (Suppl. 2), 73-81 (1987)] were adsorbed onto A1(OH) 3 and injected into groups of 10 Balb/c mice on 
days 0 and 30. Mice were bled on day 45 and the pooled immune sera examined for the presence of anti-S 
and anti-preS antibodies as described above. Table 8 shows the respective antibody titers induced in Balb/c 
mice. 

Table 8 



25 



Particles purified 


Dose 


a-HBs 


a 13-32* 


«32-47* 


from 


meg 


mlU/ml 






Y1307 (S.Agly L*) 


1 


7335 


3128 


658 


RIT4376 (S) 


1 


5232 


<100 


<100 



(*) Titres are expressed as the inverted dilution giving an OD of 1 .0 in the 
assay. 
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40 Integration of an L* expression cassette in the S. cerevisiae genome by Tyl mediated homologous 
recombination 

It is desirable to integrate both the L* and S protein expression cassettes of this invention into the host 
genome to provide genetically stable strains and to avoid the 20-30% of plasmidless cells generally seen 
45 with 2 micron based S. cerevisiae vectors. 

Plasmids pRIT13T59 and pRIT13460 were constructed as described for the vector pRIT13009-p in co- 
pending U.S. patent application SN 009,325 and in Jacobs et al. (Gene 80 :279-291, 1989). The LEU2 gene, 
isolated as a 2200 bp Xhol-Sall fragment from the vector pCV9 (Petes et al., Cell 19 :765, 1980). was 
inserted between the Sail sites of the Tyl element on pRIT12927, replacing the original fragment, to give 
so pRIT13144. A 3050 bp Hindlll-Kpnl fragment carrying the L* expression cassette from pRIT13220 described 
in Example F.3 was treated with T4 polymerase and ligated into the T4 polymerase treated Sail site 
remaining on pRIT13144 and using E coli strain XL1-Blue as transformation recipient. XL1-Blue is 
described by Hanahan D. in J.Mol. Biol., 166 :557-580 (1983) and is commercially available from 
Stratagene, 11099 North Torrey Pines Road. La Jolla, California. Both orientations of insertion of the Hindlll- 
55 Kpnl cassette fragment were obtained; pRIT13459 contains the cassette ARG3 terminator region proximal to 
the LEU2 gene, pRIT13460 contains the cassette TDH3 promoter proximal to the LEU2 gene. 

The large Xhol fragments of about 7700 bp from both pRIT13459 and pRIT13460 were purified from 
1% agarose gels and used to transform strain EJ cupl A 3d (see Example F.5 above) to leucine 
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independance by the method of Hinnen et al. (Proc. Natl. Acad. Sci. USA, 75 :1929, 1978). 

LEU* transformants were obtained with both fragments and 12 colonies from each series were analysed 
by Western blotting for L* protein expression and by Southern blotting for the number of integrated 
cassettes. From the transformation of EJ cupl A 3d with the fragment from pRIT13459 a strain Y1528 was 
5 retained which showed a level of L* expression equivalent to Y1306 and contained 3-5 copies of the 
cassette. From the transformation of EJ cupl A 3d with the fragment from pRIT13460 a strain Y1529 was 
retained which has similar properties to Y1528. 

Both Y1528 and Y1529 were mated with strains Y1295 and its trp~ parent strain Y1215 (see Example 
F.5 above) and with strain Y1367 to form diploids as follows : Y1528 x Y1295 = Y1586, Y1528 x Y1215 = 
10 Y1587, Y1528 x Y1367 = Y1588, Y1529 x Y1295 = Y1589, Y1529 x Y1215 = Y1590, Y1529 x Y1367 = 
Y1591. Strain Y1367 is a, Ieu2, trp1, contains 6-8 copies of the same S-protein cassette as strain Y1295 and 
was obtained using the strains and methods described in the construction of Y1295 in Example F.5. 

The diploid strains listed above are screened for their levels of surface antigen expression and the ratio 
of L* and S polypeptides in the particles by Western blotting, AUSRIA assay and epitope specific ELISA 
is tests using the reagents and methods described above, more particularly in Examples F.2 and F.6. The 
number and stability of the integrated expression cassettes is determined by Southern blotting. If desired 
the diploid strains may be sporulated to obtain haploid segregants which may then be screened for 
expression and genetic stability as described above. 

insertion of the 3050 bp Hindlll-Kpnl fragment from pRIT13222 on pRIT13144 and following of the 
20 above described procedures may be carried out to give S. cerevisiae strains with integrated copies of the A 
gly L* expression cassette and strains expressing S ,A gly~L* particles. 

In a further embodiment the L" expression cassette is inserted on a vector similar to pRIT13134-P 
[Jacobs et al., Gene 80 ; 279-291 (1989)] and which carries the URA3 and CUP1 genes as selective 
markers inserted withirilhe Tyl element of pRIT12927. Such a vector is identified as pRIT13501 from which 
25 a Bglil fragment of about 6500 bp with the above elements can be recovered for transformation and 
integration into an appropriate yeast host by selection for URA* transformants. Following transformation the 
number of integrated cassettes may be advantageously increased by selection for increased levels of 
copper resistance as disclosed in copending U.S. application SN 07/368,401. 

30 

Example F.1 1 



Expression of S, L* mixed particles with ad and ay subtype determinants 

35 

An integrative vector with an expression cassette for an L* protein of ay subtype was constructed by 
exchanging the Xbal-Sacli fragment encoding the C-terminal part of an ad subtype S protein and part of the 
ARG transcription terminator with a corresponding fragment from an S-gene of ay subtype. Plasmid 
pRIT13459 (Example F.10) was digested with Xbal and Sacll endonucleases and the largest fragment was 

40 purified from a 1% agarose gel and ligated to a purified 1000 bp Xbal-Sacll fragment from pRIT10780 to 
give pRIT13500. pRIT10780 is disclosed in De Wilde et al., Develop. Biol. Standard, 59 :99-107, (1985). 

The L* coding sequence on pR(T13500 consists of DNA with sequences identicaTto those from a virus 
of ad specificity from the ATG codon to the Xbal site and of DNA derived from a virus of ayw specificity 
from the Xbal site to the termination codon. The expressed L* polypeptide will have ay subtype deter- 

45 minants. 

The Xhol fragment of about 7700 bp from pRIT13500 can be integrated into the yeast genome and the 
resulting transformants mated with strains such as Y1295 and Y1367 by the methods described above so 
as to obtain strains expressing mixed particles in which the L* polypeptide is of ay subtype and the S 
polypeptide of ad subtype. 

50 The above description and examples are illustrative and not limiting of the invention. For example, a 
variety of additional modifications to the L protein may be made, e.g., partial deglycosylation or a 
combination of various of the modifications made herein, in addition to those mentioned in the examples to 
generate modified L proteins and particles of mixed or homogeneous subunit composition containing these 
modified L proteins of appropriate composition for vaccine use. 

55 Additional modifications that may be made include reintroduction of preS sequences, introduction of 
coding sequences derived from HBV protein coding sequences other than from the envelope gene, e.g. 
sequences from the HBV core gene, or sequences coding for immunological important amino acid regions 
from other pathogens. Additional known vector components may be employed to replace the specific 
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marker genes, promoters and linker sequences and the like employed in the examples. Other types of host 
cells may be employed. The host cells, vectors and vector components employed in these examples are 
illustrative and may be replaced with other strains or modifications by one of skill in the art. This invention 
encompasses all improvements and modifications falling within the scope of the following claims. 
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5 

Claims 

1. A modified hepatitis B virus large surface protein comprising an amino acid sequence encoding the L 
protein characterized by a modification in at least one of the following sequences: a sequence sensitive to 

io protease digestion, a sequence necessary for myristylation, a sequence necessary for N-linked glycosyla- 
te, a sequence necessary for O-linked glycosylation, and a sequence necessary for binding of human 
serum albumin. 

2. A modified protein according to claim 1 comprising a primary translated amino acid sequence lacking a 
glycine residue in the second position thereof. 

is 3. A modified protein according to claim 1 wherein any sequence of Asn-X-Thr or Asn-X-Ser t X being any 
amino acid, is altered by deletion or replacement of Asn, deletion or replacement of Thr or Ser, or deletion 
of X. 

4. A modified protein according to claim 1 or claim 2 comprising an amino acid sequence comprising 
residues 12-52, or residue 12 followed by residues 14-52, followed by residues 133-145 followed by 

20 residues 175-400 of the L protein. 

5. A particle comprising a multimeric assembly of a modified L protein of any one of claims 1 to 4. 

6. A composite particle containing at least two polypeptides corresponding to all or part of a protein having 
the biological activity of one of the hepatitis B surface antigens, wherein the particle presents at least two 
antigenic determinants provided by the S-protein f preS2-protein and/or preS1 -protein, said particle option- 

25 ally further comprising host specific lipids. 

7. A particle according to claim 6 comprising a multimeric assembly of the S protein of HBsAg and a 
modified L protein of any one of claims 1 to 4, 

8. A composite particle of the formula (L\ S) wherein L* denotes a modified L protein of hepatitis B virus 
having an amino acid sequence comprising residues 12-52 followed by residues 133-145 followed by 

30 residues 175-400 of the L protein and S denotes the S-protein of hepatitis B surface antigen, said particle 
being in the form in which it is expressed in S . cerevisiae or H . polymorpha . 

9. A particle as claimed in Claim 7 or Claim~8 in which the S protein is of ad subtype and the modified L 
protein is of ay subtype or vice versa . 

10. A vaccine comprising an immunoprotective amount of a modified L protein of any one of claims 1 to 4 
35 or of a particle of any one of claims 5 to 9. 

11. A recombinant DNA molecule comprising a DNA coding sequence for expression of a modified L 
protein of any one of claims 1 to 4 operatively linked to a regulatory sequence. 

12. A recombinant vector comprising the DNA molecule of claim 11. 

13. A host cell transformed with a recombinant vector of claim 12. 

40 14. A cell according to claim 13 selected from the group of yeasts consisting of Saccharomyces , 
Hansenula , Pichia , Candida , Kluyveromyces and Schizosaccharomyces . 

15. A cell according to claim 14 comprising S . cerevisiae or Hansenula polymorpha . 

16. A cell as claimed in any one of claims"! 3 to 15 which is additionally transformed with a recombinant 
DNA vector capable of expressing the S protein of HBV. 

45 17. A method of producing a modified L protein of any one of claims 1 to 4 comprising culturing a cell 
according to any one of claims 13 to 15 in appropriate culture media and isolating said protein from the cell 
lysate or extract of said culture. 

18. A method of producing simultaneously a modified L protein and S protein comprising culturing a cell of 
claim 16 and isolating from the cell lysate or extract of said culture protein particles of mixed subunit 

so composition. 

19. A microorganism selected from the group of methylotrophic yeast genera carrying 

A) more than one copy of an expression cassette (ed) coding for a first polypeptide and/or one or more 
than one copy of an expression cassette (ec2) coding for a second polypeptide, optionally in addition one 
or more than one expression cassette (ec3) coding for a third polypeptide, 

55 or 

B) one copy of an expression cassette (ed) coding for a first polypeptide and one or more than one copy 
of an expression cassette (ec2) coding for a second polypeptide, optionally in addition one or more than 
one expression cassette (ec3) coding for a third polypeptide, 

62 



EP 0 414 374 A2 



wherein the expression cassettes comprise: 

(a) a regulon R which is responsive to methanol and/or depletion of catabolite repressing carbon sources; 

(b) an open reading frame coding for ail or part of a protein having the biological activity of one of the 
hepatitis B surface antigens; 

5 (c) optionally a DNA sequence serving as a terminator T; 

wherein R exerts control on the transcription of the open reading frame and T directs polyadenylation and/or 
termination of transcription of a produced mRNA. 

20. A microorganism as claimed in claim 19, characterised in that the first polypeptide represents all or part 
or parts of the S-protein and the second polypeptide represents all or part or parts of the preS1 -protein and 

70 the optional third polypeptide represents all or part or parts of the preS2-protein or the second polypeptide 
represents all or part or part's of the preS2-protein. 

21. A microorganism according to claim 19 or claim 20, characterised in that the methylotrophic yeast is 
selected from a species of the genera Candida , Kloeckera , Saccharomyces , Rhodotorula , Torulopsis , 
Pichia , or Hansenula . 

75 22. A microorganism according to any one of claims 19 to 21, characterised in that the microorganism is 
derived from Hansenula , preferably Hansenula polymorpha , most preferably from a strain having the 
identifying characteristics of Hansenula polymorpha RB 1 0 (DSM 521 5). 

23. A microorganism according to any one of claims 19 to 22, characterised in that the regulon is derived 
from a gene involved in methanol utilisation, preferably a MOX-gene, FMD gene, DAS gene or catalase 

20 gene, wherein said genes are most preferably derived from Hansenula polymorpha . 

24. A methylotrophic yeast organism having the identifying characteristics of strain DSM 5215. 

25. A DNA molecule containing an expression cassette (ed) coding for a first polypeptide and/or an 
expression cassette (ec2) coding for a second polypeptide and/or an expression cassette (ec3) coding for a 
third polypeptide, wherein the expression cassettes comprise: 

25 a) a regulon R which is responsive to methanol and/or depletion of catabolite-repressing carbon sources: 

b) an open reading frame coding for all or part of a protein having the biological activity of one of the 
hepatitis B surface antigens; 

c) optionally a DNA sequence serving as a terminator T 

wherein R exerts control on the transcription of the open reading frame and T directs polyadenylation and/or 
30 termination of transcription of a produced mRNA. 

26. A DNA molecule comprising an URA3 gene, characterised in that said URA3 gene is derived from 
Hansenula polymorpha . 

27. A vector suitable for transformation of a methylotrophic yeast, characterised in that the vector comprises 
a DNA molecule according to claim 25 or claim 26. 

35 28. A process for transforming a methylotrophic yeast host organism with a first vector carrying a first 
expression casette and optionally a second vector carrying a second expression casette and optionally a 
third vector containing a third expression casette and wherein at least some of the expression casettes are 
integrated by recombination with the chromosome of said yeast host organism as a multimeric repeat, 
which process comprises the steps of: 

40 a) transforming the host with the first vector and growing it initially thereafter under one selective 
medium, followed by incubation under non-selective conditions for several generations; 

b) selecting mitotically stable transformants containing more than one expression cassette coding for a 
first polypeptide; 

c) optionally curing said transformants of residual autonomously replicating plasmids; 

45 d) optionally transforming said transformants with the second vector and growing them initially on 
selective medium, followed by incubation under non-selective conditions for several generations; 
e) selecting mitotically stable transformants containing more than one expression cassette coding for the 
first polypeptide and optionally one or more than one expression cassette coding for a second 
polypeptide; 

50 f) optionally curing said transformants of residual autonomously replicating plasmids; 

g) optionally repeating steps d) to f) using one of the previously mentioned vectors or a third vector 
carrying a third expression casette coding for a third polypeptide. 

Claims for the following Contracting State: ES 

55 

1. A method of producing a modified hepatitis B virus large surface protein comprising an amino acid 
sequence encoding the L protein characterized by a modification in at least one of the following sequences: 
a sequence sensitive to protease digestion, a sequence necessary for myristylation. a sequence necessary 
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for N-linked glycosylate, a sequence necessary for O-linked glycosylate, and a sequence necessary for 
binding of human serum albumin, which method comprises culturing a host cell comprising a DNA coding 
sequence for expression of the modified L protein in appropriate culture media and isolating said protein 
from the cell lysate or extract of said culture. 
5 2. A method according to claim 1 wherein the modified L protein comprises a primary translated amino acid 
sequence lacking a glycine residue in the second position thereof. 

3. A method according to claim 1 wherein the modified L protein has any sequence of Asn-X-Thr or Asn-X- 
Ser, X being any amino acid, altered by deletion or replacement of Asn, deletion or replacement of Thr or 
Ser, or deletion of X. 

to 4. A method according to claim 1 or claim 2 in which the modified L protein comprises an amino acid 
sequence comprising residues 12-52, or residue 12 followed by residues 14-52, followed by residues 133- 
145 followed by residues 175-400 of the L protein. 

5. A method according to any one of Claims 1 to 4 in which the host cell is selected from the group of 
yeasts consisting of Saccharomyces , Hansenula , Pichia , Candida , Kluyveromyces and Schizosac- 

rs charomyces . 

6. A method according to claim 5 in which the yeast cell comprises S . cerevisiae or Hansenula polymorpha 

7. A method as claimed in any one of claims 1 to 6 in which the modified L protein is expressed in the form 
of a particle simultaneously with the S-protein of hepatitis B surface antigen, which method comprises 

20 culturing a host cell which has been additionally transformed with a recombinant DNA vector capable of 
expressing the said S-protein and isolating from the cell lysate or extract protein particles of mixed subunit 
composition. 

8. A method as claimed in claim 7 in which the particle of mixed subunit composition has the formula (L*. S) 
wherein L* denotes a modified L protein of hepatitis B virus having an amino acid sequence comprising 

25 residues 12-52 followed by residues 133-145 followed by residues 175-400 of the L protein and S denotes 
the S-protein of hepatitis B surface antigen. 

9. A method as claimed in claim 7 or claim 8 in which the S protein is of ad subtype and the modified L 
protein is of ay subtype or vice versa . 

10. A process for the preparation of a transformed microorganism selected from the group of 
30 methylotrophic yeast genera, which process comprises transforming the said microorganism with a vector 

comprising a DNA molecule containing an expression cassette (ec1) coding for a first polypeptide and/or an 
expression cassette (ec2) coding for a second polypeptide and/or an expression cassette (ec3) coding for a 
third polypeptide, wherein the expression cassettes comprise: 

a) a regulon R which is responsive to methanol and/or depletion of catabolite-repressing carbon sources; 
35 b) an open reading frame coding for all or part of a protein having the biological activity of one of the 
hepatitis B surface antigens; 

c) optionally a DNA sequence serving as a terminator T 

wherein R exerts control on the transcription of the open reading frame and T directs polyadenylation and/or 
termination of transcription of a produced mRNA. 
40 11. A process as claimed in claim 10, characterised in that the first polypeptide represents all or part of the 
S-protein and the second polypeptide represents all or part of the preS1 -protein and the optional third 
polypeptide represents all or part of the preS2-protein or the second polypeptide represents all or part of 
the preS2-protein. 

12. A process as claimed in claim 10 or claim 11 in which the methylotrophic yeast is selected from a 
45 species of the genera Candida , Kloeckera , Saccharomyces , Rhodotorula , Torulopsis , Pichia , or 

Hansenula . 

13. A process as claimed in any one of claims 10 to 12 in which the regulon is derived from a gene 
involved in methanol utilisation, preferably a MOX-gene, FMD gene, DAS gene or catalase gene, wherein 
said genes are most preferably derived from Hansenula polymorpha . 

so 14. A process as claimed in any one of claims 10 to 13 in which the methylotrophic yeast is selected from 
Hansenula , preferably Hansenula polymorpha , most preferably from a strain having the identifying 
characteristics of Hansenula polymorpha RB 10 (DSM 5215). 

15. A method for the preparation of a composite particle, characterised in that a transformed methylotrophic 
yeast strain, prepared according to the process of any one of claims 10 to 14 is cultivated in a suitable 

55 culture medium under conditions where protein synthesis occurs and the proteins are isolated from the 
culture. 

16. A process for preparing a vaccine against hepatitis B infection, which process comprises admixing the 
protein or particle prepared according to the method of any one of Claims 1 to 9 or 15 with a suitable 
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